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(54) Lumazine synthase and riboflavin synthase 

(57) Through function complementation of E. cod 
auxotrophs, the ultimate and pentultimate enzymes of 
the spinach riboflavin biosynthetic pathway have been 
cloned, namely, lumazine synthase (LS) and riboflavin 
synthase (RS). This invention relates to the isolation of 
nucleic add fragments from plants or fungi that encode 
LS protein. The invention also relates to the Isolation of 
nucleic acid fragments from plants or fungi that encode 
RS protein. In addition, the invention also relates to the 
construction of chimeric genes encoding all of a portion 
of LS, in sense or antisense orientation, wherein the 
ejqDression of the chimeric gene results in production of 
altered levels of plant LS in a transformed host cell. Fur- 
thermore, the invention also relates to the construction 
of chimeric genes encoding all of a portion of RS. in 
sense or antisense orientation, wherein the expression 
of the chimeric gene results in production of altered lev- 
els of plant or fungal RS in a transformed host ceil. In 
viyo and in vitro methods to identify herbicide or fungi- 
cide candidates are included that evaluate the ability of 
a chemical compound to inhibit the activity of a plant or 
fungal LS enzyme or a plant or fungal RS enzyme. 
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Description 

[0001 J This invention is in the fieW of plant and fungal molecular biology. More ^ecifically, this invention pertains to 
nudec acid fragments encoding proteins involved in the riboflavin biosynthetic pathway of plants or fungi. 

5 [0002] Rlt>oflavia vitamin B2. is the precursor of flavin mononudeotide (FMN) and flavin adenine dinucleotide 
(FAD), essential cofactors for a number of nfiainstream metabolic &\zymes that mediate hydride, oxygen, and electron 
transfer reactions. Riboflavin-dependent enzymes include succinate dehydrogenase, NADH dehydrogenase, ferre- 
doxin-NADP* oxidoreductase. acyl-CoA dehydrogenase, and the pyruvate dehydrogenase complex. Consequently, 
fatty acid oxidation, the TCA cycle, mitochondrial electron-transport photosynthesis, and numerous other cellular proc- 

70 esses are aitically dependent on either FMN or FAD as prosthetic groups. Other notable f lavoproteins include glutath- 
ione reductase, glycolate oxidase. P450 oxido-reductase. squalene epoxidase. dihydroorotate dehydrogenase, and a- 
glycerophosphate dehydrogenase. Genetic disruption of riboflavin biosynthesis in E. cofi (Richter et al., J. BacterioL 
174:4050-4056 (1992)) and S. cerevisiae (Santos et al.. J. Biol. Chem. 270:437-444 (1995)) results in a lethal pheno- 
type that is only overcome by riboflavin supplementation. This is not surprising, considering the ensemble of deleterious 

15 pletotrx3pic effects that vvould occur vWth riboflavin depri^^ 

[0003] Riboflavin is synthesized by plants and numerous microorganisms, including tacteria and fungi (Bacher. A. 
Chemistrv and Biochemistry of Flavooroteins (MQIIer, R. ed.) vol. 1, pp. 215-259, Chemical Rubber Co., Boca Raton. 
FL (1990)). Since birds, mammals, and other higher organisms are unable to synthesize the vitamin and, instead, rely 
on its dietary ingestion to meet their metabolic needs, the enzymes that are responsible for riboflavin biosynthesis are 

so potential targets for future antibiotics, fungicides, and herbicides. Moreover, it is possible that the distantly-related plant 
and microbial enzymes have distinct characteristics that could be exploited in the development of potent organism-spe- 
cific inhibitors. Thus, a detailed understanding of the structure, mechanism, kinetics, and substrate-binding properties 
of the riboflavin biosynthetic enzyme(s). from plants for example, would serve as a starting point for the rational design 
of chemical compounds that might be useful as herbicides. Having the authentic plant protein(s) In hand would also pro- 

25 vide a valuable tool for the in vitro screening of chemical libraries in search of riboflavin biosynthesis inhibitors. 

[0004] Bacterial and fungal riboflavin biosynthesis has been intensively studied for more than four decade (For 
recent reviews, see Bacher, A., Chemistrv and Biochemistry of Flavooroteins (Mulller. F.. ed.) vol, I. pp. 215-259 and 
293-316 Chemical Rubber Co.. Boca Raton. FL (1990)). The synthetic pathway consists of seven distinct enzyme cat- 
alyzed reactions, with guanosine 5'-triphosphate (GTP) and ribulbse 5-phosphate the ultimate precursors. While the 

30 second and third steps of riboflavin biosynthesis occur in opposite order in bacteria and fungi, the remaining pathway 
intermediates are identical in both microorganisms. Structurally and mechanistically, the last two reactions in the path- 
way, namely, those catalyzed by 6,7-dimethyl-8-ribityllumazine synthase (LS) and riboflavin synthase (RS), are best 
characterized. In B. subtilis, these two enzymes are physically associated with each other in a huge spherical particle 
with a combined molecular mass of about 1 MDa (Bacher et al.. J. Biol Chem. 255:632-637 (1980): Ritsert et al., J, Mol. 

35 Biol. 253. 151-167 (1995); Bacher et al., Biochem. Soc. Trans. 24(1):89-94 (1996)); the X-ray structure of the bifunc- 
tionat protein complex has been determined at 3.3 angstrom resolution (Ladenstein et al.. J. Mol. &/b/ 203:1045-1070). 
The LS/RS complex consists of 60 LS subunits that are organized into 12 pentamers to form a hollow icosahedral cap- 
sld. Encaged in the central core of this structure resides a single molecule of RS, a trimer of three identical subunits. 
Kinetic studies reveal that the compartmentation of the two enzymes within the complex improves the overall catalytic 

40 efficiency of riboflavin production at low substrate concentrations, pi-esumably via "substrate channelirig" (Kis et al., J. 
Biol. Chem. 270:16788-16795 (1995)). Although a bifunctional LS/RS complex has not been observed in other micro- 
organisms, it was recently shown that the native £. coif LS also exists in v/vo as a hollow icosahedral capsid of 60 iden- 
tical subunits (MOrtl et al., J. Biol. Chem. 271 :33201-33207 (1996)). 

[0005] LS, the penultimate enzyme of ritxTflavin biosynthesis, catalyzes the condensation of 3.4-dihydroxy-2- 
45 butanone 4-phosphate with 4-ribrfylamino-5-amino-2,6-dihydroxypyrimidine (RAADP) to yield 1 mol each of orthophos- 
phate and 6,7-dimethyl-8-(V-D-ribityl)-lumazine (DMRL). The latter is the immediate precursor of riboflavin. LS-encod- 
ing genes have been cloned from numerous microorganisms, including E. coli (Taura et al., MoL Gen.Genet. 234:429- 
432 (1992)). A pfeuropneumoniae (Fuller et al., J. BacterioL 177:7265-7270 (1995)), P. phosphoreum (Lee et al., J. 
BacterioL 1762100-2104 (1994)), B. subtilis (Mironov et al.. DokL Akad. Nauk SSSR 305:482-487 (1989)), and S. 
so cerevisiae (Garcia- Ramirez et al., J. BioL Chem. 270:23801 -23807 (1 995)). In ail cases, the subunit molecular mass of 
the LS gene product is small, ranging in size from ~16-17 kDa. \/Vhile the various LS homologs all share certain struc- 
tural features in common, their overall homology at the primary amino acid sequence level is rather poor. For example, 
as determined with the Genetics Computer Group Gap program (Wisconsin Package Version 9.0, Genetics Computer 
Group (GCG). Madison. Wise), the E. coli LS is only 58%. 65%. 53%. and 36% identical to the homologous proteins 
55 of A. pleuropneumoniae, R phosphoreum, B. subtilis and S. cerevisiae, respectively. Indeed, painwise comparisons of 
these five proteins reveal that the two most similar homologs share only 72% identity. 

[0006] The terminal step of n'bof lavin biosynthesis is mediated by RS. This enzyme catalyzes the dismutation of two 
molecules of DMRL to yield 1 mol of riboflavin and RAADR That the latter product is also one of the substrates of LS 
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637 (1980)) and fungal (SaSL «T"1^r^^^ 

Identical "25 kDa subunfts. To date. RS has only been clonStemS JT^ ■ 

cies that have been examined exhibit mafidZ^mt?H^^^^^ microorganisms, and all of the spe- 

tions. ithasbeen suggested th« the twh^^^ fiWrif ! 270:437-444 (1995)). Based on these observa- 

of the various RS proteins is extremely limrted Xs Z ^"^'!'/'""«^'*y' ^^^r. the oveiall sequence homology 
to its counterparts in S. cerev^alTp Ztl^^ iS^^ '"^ 
the latter four proteins are Z21621 . L1 1391 X515^ St^e^Set """^"^ "'^ 

Sg Zn^'r^e^bTC^erm^^^^ 

Cloned from an arabidopsis cDNA ^^TilS^^Tsl ler^-iSfoJ^^^ 

cloned plant gene is only 37-58% identk»l to #Thr,Zjr:J, 160-303-304 (1995)). The protein sequence of the 
p/7aspo,eu/n.'while fulWength^NA ^^enc*^^^ ^"^^^^^ '^'^"^"'^ and 

enzyme, the GenBankdatalSseconteinTt^r^^^^^ P'^"* '^'^ "''^y^etic 

SS?, """^^ "'^^^ '^^-^ ''^^^^ ''^^ '-^^^^^ 

residues. The arabidopsis cDNA doneTGen^nk acSss^^^ *° '^^ ^"♦^^"^'^ ^ 

a BLAST (Basic LoS Alignment sirch T^ fSh^^^^^ 

TBLASTN algorithm providXlelSJ^SL;^^!^^^^^ 215:403-410 (1990)) search using tSe 

BLAST search vvas t^e translatL I. cSnTg^Totn^^ 

Similarity to the ambldopsis EST was^Io ^ t^SSS th^^rn"^?"" l^^^ ^ P'^"'*^ 
tained only the last 26 C-tem,ina. amino acTd reSi^te p^^n S n?r«rS;*J!* ^"^"^ 
length cDNA clone. Since neither of these clones pSsesH Sa h v "T^*" ^^^^^^ '* a partial or fun- 
s'^ Tc^'n^Tt^ 

detected in extracts from variousX 2ecS G^^^^ ""^^ ^as been 

Academic Press. N«v YorK (1961)) ^^ S^Jl (Greenberg. D.M.. ed.). vol II. p. 673. 

Methods Enzymol. ItorSSgi^Jigiw ^ 

. Sltlo^X^ttcSt^^^^ 

amino acid sequence mSlfs^ncSoMj^JundL^^ ohgrcnucleotde primers con-esponding to conserved 

nately. these techniques ^S^l^^^SVbT^Zl^^ f"*^ '"'^^^'^ Unfortu- 

all hea^.■|y rely on the presence of si^mSSL '(T; "?nTS all'^^ '""^^ 

te,ns and genes that havefhe same Lotion. Gisf^SL^^^Za^^ h oc '^"^"''^^ P^*" 

served. even amongst microorganisms, irhiST^nTte^rSTS^ ^-^"^ are both so pooriy con- 

structural s.milariti J^ith thelrSu^rpi^s in^her Ss^ '"""'"^^ ^^^'^"^ 

z:^ eucaX^stjuysXe,::^^^^^^^^ s:n'nr ^2"- ^^^y^ 

Since this strategy relies iy o7fhe7nc«^^ are defiaent in the enzyme activity of Interest, 

and the target gene of interest it is ideaMvsurt!^ ^^1^ ^ ® P™*®'" ^ disrupted host gene 

«on.ForJnc.iLlco,:JreS.4«o^^^^^^^^^^^ 

CDNA in the microbial host The plasmid ItoraVy fe ftX^^^ ^" ^'^^ expression of the 

that are no longer phenotypically ^nt tJ^ r^SSj? ^'-^ '""=^°^^- ^ «"°"'«« s«'«ct«l 

(1995)) and RS (Santos iai.. J SjS,i^ 2lal?7 ^i?^^^^^^^^^^ ^- '^^ 27053801-23807 

(Kobayashi et al. Gene 1 60:303-^4 ^^^ggl^^were^^^^^ f yeast, and arabidopsis GTP cyclohydrolase II 

auxotrophs. This strategy has aCvirS J?^ complementation of microbial riboflavin 
bolicpathways. including lysine ttoSSi^FS^ '^^"^ ^'^'^^ '"^^ 

(Aimi et al.. J. Biol. Crtem. les^l^^miSSTa?^^^^ 228287-293 (1991)). purine biosynthesis 
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malate dehydrogenase (EOerstrom et al.. Plant MoL Biol 18:557-566 (1992)). and dihydroorotate dehydrogenase 
(Minet el a!.. Plant J. 2:417-422 (1992)). 

[001 2] Despite the obvious attractive features of cloning by functional complementation, there are several reasc^is 
why this approach might not work when applied to the higher plant LS and RS genes. First, the eucaryotic cDNA 

5 sequence might not be expressed at adequate levels in the mutant microbe for a variety of reasore. including differ- 
ences in prefenred codon usage. Second, the doned eucaryotic gene might not produce a functional polypeptide, if for 
instance, enzyme activity requires a post-translational mocfif ication, such as acetylation. glycosyfation, or phosphoryla- 
tion that is not carried out by the microbial host. Third, the heterologous plant protein might be lethal to the host, thus 
rendering its expression impossible. Fourth, the eucaryotic protein might fail to achieve its native conformation in the 

JO foreign microbial environment, due to folding problems, inclusion body fomiation, or various other reasons. It is also 
possible that the high^ plant LS and RS enzymes are nuclear-encoded proteins that are posttransiationally targeted to 
chloroplasts. mitochondrial, or some other organelle that is not present in the microbial host If this were the case and 
proteolytic removal of the organeliar targeting sequence was required for enzyme activity, cloning these genes by func- 
tional complementation would not be possible. 

75 [0013] The instant invention relates to isolated nucleic add fragments encoding plant or fungal enzymes involved 
in riboflavin biosynthesis. Specifically, this invention concems isolated nucleic acid fragments encoding a plant or fungal 
LS, particularly wherein the plant is spinach, tobacco or arabidopsis and the fungus is Magnaporthe grisea. This inven- 
tion also concerns isolated nudeic acid fragments encoding a plant or fungal RS. particularly wherein the plant is spin- 
ach or arabidopsis and the fungus id Magnaporthe grisea. In addition, this invention relates to nucleic add fragments 

20 that are complementary to nudeic acid fragments encoding a plant or fungal LS enzyme or a plant or fungal RS 
enzyme. 

[0014] Specific isolated nucleic acid fragments encoding a plant LS enzyme are (a) an isolated nudeic add frag- 
ment encoding all or a substantial portion of the amino acid sequence selected from the group consisting of SEQ ID 
NO:2, SEQ ID N0:4 and SEQ ID N0:6; (b) an isolated nudeic add fragment that is substantially similar to an isolated 

25 nudeic acid fragment encoding ail or a substantial portion of the amino acid sequence selected from the group consist- 
ing of SEQ ID NO:2. SEQ ID N0:4 and SEQ ID N0:6; (c) an isolated nucleic acid fragment encoding a polypeptide hav- 
ing at least 72% identity with the amino acid sequence selected from the group consisting of SEQ ID N0*2. SEQ ID 
NO:4 and SEQ ID N0:6: and (d) an isolated nucleic add fragment that is conplementary to (a), (b) or (c). 
[0015] Special isolated nucleic acid fragments encoding a plant RS enzyme are (a) an isolated nudeic acid frag- 

30 ment encoding all or a substantial portion of the amino acid sequence selected from the group consisting of SEQ ID 
N0:8 and SEQ ID NO:10; (b) an isolated nudeic acid fragment that is sutDStarrtially similar to an isolated nudeic acid 
fragment encoding all or a substantial portion of the amino add sequence selected from the group consisting of SEQ 
ID N0:8 and SEQ ID NO:10; (c) an isolated nucleic acid fragment encoding a polypeptide having at least 70% identity 
with the amino acid sequence selected from the group consisting of SEQ ID N0:8 and SEQ ID NO:10; and (d) an iso- 

35 lated nucleic add fragment that is complementary to (a), (b) or (c). 

[001 6] Specific isolated nudeic add fragments encoding a fungal RS enzyme are (a) an isolated nucleic acid frag- 
ment encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID N0:12; (b) an isolated nudeic 
acid fragment that is substantially similar to an isolated nucleic acid fragment encoding all or is homologous to at least 
a substantial portion of the amino add sequence set forth in SEQ ID N0:12; (c) an isolated nucleic acid fragment that 

40 is complementary to (a) or (b). 

[0017] Specrfic isolated nucleic acid fragments encoding a fungal LS enzyme are (a) an isolated nudeic acid frag- 
ment encoding all or a substantial portion of the amino acid sequence set forth in SEQ ID NO:38; (b) an isolated nucleic 
acid fragment that is substantially similar to an isolated nucleic acid fragment encoding all or is homologous to at least 
a SLd3Stantial portion of the amino add sequence set forth in SEQ ID NO:38; and (c) an isolated nucleic add fragment 

45 that is complementary to (a) or (b). 

[0018] In another embodiment, the instant invention relates to chimeric genes encoding a plant or fungal LS 
enzyme or a plant or fungal RS enzyme or to chimeric genes that comprise nucleic add fragments that are complemen- 
tary to the nudeic add fragments encoding the enzymes, operably linked to suitable regulatory sequences, wherein 
expression of the chimeric genes results in production of levels of the encoded enzymes in transformed host cells that 

so are altered (i.e.. increased or decreased) from the levels produced in the untransformed host cells. 

[001 9] In a further embodiment, the instant invention concems a transformed host cell comprising in its genome a 
chimeric gene encoding a plant or fungal LS enzyme or a plant or fungal RS enzyme, operably linked to suitable regu- 
latory sequences, wherein expression of the chimeric gene results in production of altered levels of a plant LS enzyme 
or a plant or fungal RS enzyme in the transformed host cell. The transformed host cells can be of eucaryotic or procary- 

55 otic origin, and include cells derived from higher plants and microorganisms. The invention also indudes transformed 
plants that arise from transformed host cells of higher plants, and from seeds derived from such transformed plants. 
[0020] An additional embodiment of the instant invention, concerns a method of altering the level of expression of 
a plant or fungal LS enzyme or a plant or fungal RS enzyme in a transformed host cell comprising: a) transforming a 
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cr« p^teceon chertcal coowlls the Sr*T<^ST " = """"W »Kyme and thus 

nost cell of step (a) with a chimeric gene comDrisina an isniatoH^ , W transforming the altered microbial 

"^^"^^ »^?2S;:Xe encoding a plant or fun^ls 

that allows its expression in the microbial host cell- nmZ,^^ ^. ^ ®* ="'*abie regulatory seauence 

m both the presence and absence of added nbof lavin- 7e We^^n^^ ? ^ ® srowiVig exponentially and 

compound of interest that inhibits growth ^J^'*^^^ or fungicide candidate the <*aj^ 

added riboflavin. Suitable isolated nudefeaid ta^!^ I '''^ ""'^ 9rown in theabseSHf 
» v.n^ assay include the £ «,// LS and S'itXt ST^fooi^ iT! -f^"- ^"'^^'^ ho^te for^ t 

^ ;-rT:n°Lrr^^^^^^^ 

didateachemicalcompoundthatirto^^^^^^ as an herbicide or fungic«le can- 

pnsmg a nude,c add fragmert encoding a pl^ or £ne^^^ 
operably linked to at least one suitableTegul^rv s^^^ 
djfons suitable for expression of the ciSS^^nr^^^^ 

plant or fungal RS enzyme; (c) purifying the o^br q Production of the plant or fungal LS enzyme or a 

JeJansfom,«| host c^l; ^iaLS^S^^Sc^e^'::; enzyme ^SedbJ 

bicde or fungicide candidate the cheLi S^u^^ treSf iStT.""'' WentifyinJ^^r- 
enzyme or plant or fungal RS enzyme relativVto^rachVi^*^!!! T ^^"^ """^ °^ P'^"* °? f"ngal LS 
compound of interest Sudi reduced activity inSites 2Xf rhi^^f*^® ^"^^ ^^ence of the *em.4i 
tect^n d,emica.. Suitable isolated nucl^tS^:^J:^'^j;:^ "sefu. as a cr^Z 

" Scs ^e;-<^::Zi:Zn!:r ^^"'"^"^ ^ ^ P«eup prog^m 

^ Sn^ R?XrrXrr^"° ^^'^ ^'^--^^ of ^ .crob^ «s h^ogs and ^ doi^ 

Figure 4 shows primary amino add sea2r««Siri 5 ®^ Peptides). 

Sequence D^ptons oonS/nT,. on, Wte^S* toS«15?^ """O S^">a« ST sTSS 

en.TO adds es defined In oortbnwty wmiZ SpIX?;^'?"? chaiaclers end the Mae leoer ocAs to! 
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SEQ ID N0:1 is this nucfeotlde sequence of a doned cDNA encoding a mature spinach IS. 

SEQ ID N02 is the deduced amino ackl sequence of the cloned cDNA encoding a mature spinach LS. 

SEQ ID N0:3 is the nucleotide sequence of a doned cDNA encoding a mature tobacco LS. 

SEQ ID N0:4 is the deduced amino acid sequence of the cloned cDNA encoding a mature tobacco LS. 
5 SEQ ID N0:5 is the nucleotide sequence of a doned cDNA encoding a mature arabidopsis LS. 

SEQ ID N0:6 is the deduced amino acid sequence of the cloned cDNA encoding a mature arabidopsis LS. 

SEQ ID N0:7 is the nudeotide sequence of a doned cDNA encoding a mature spinach RS. 

SEQ ID N0:8 is the deduced amino add sequence of the doned cDNA encoding a mature spinach RS. 

SEQ ID N0:9 is the nudeotide sequence of a doned cDNA encoding a rrature arabidopsis RS. 
10 SEQ ID NO:10 is the deduced amino acid sequence of the cloned cDNA encoding a mature arabidopsis RS. 

SEQ ID NO:l 1 is the nucleotide sequence of a cloned cDNA encoding a Magnaporthe grisea RS. 

SEQ ID N0:12 is the deduced amino acid sequence of the cloned cDNA encoding Magnaporthe grisea RS. 

SEQ ID N0:13 is the 5' primer useful in the amplification of E. coli LS having Qenbank accession No. X64395. 

SEQ ID N0:14 is the 3' primer useful In the amplification of E. coli LS having Genbank accession No. X64395. 
15 SEQ ID N0:15 is the 5' primer useful in the amplification of E coii RS having Genbank accession No. X69109. 

SEQ ID N0:16 is the 3' primer useful in the ampirication of E. coli RS having Genbank accession No. X69109. 

SEQ ID NQ:1 7 is the 5' primer useful for the introduction of a DNA fragment that confers kanamycin resistance into 

the £. coli LS and RS genes having Gent>ank accession Nos. X64395 and X69109. respectively, at a Noti cleav- 
age site. 

20 SEQ ID NO:18 is the 3' primer useful for the Introduction of a DNA fragment that confers kanamycin redstance into 
E. coli LS and RS genes having Genbank accession Nos. X64395 and X69109, respectively, at a Noti deavage 
site. 

SEQ ID N0:19 is one of the PGR primers useful for the introduction of a Nott deavage site in the middle of E coli 
LS having Genbank accession No. X64395 (hybridizes to nt 2273-2290). 
25 SEQ ID NO:20 is one of the PGR primers useful for the introduction of a Noti cleavage site in the middle of E. coli 
LS having Genbank accession No. X64395 (hybridizes to nt 2243-2261). 

SEQ ID N0:21 is one of the PGR primers useful for the introduction of a Noti cleavage site in the middle of £. coll 
RS having Genbank accession No. X69109 (hybridizes to nt 1217-1233). 

SEQ ID NO:22 is the one of the PGR primers useful for the introduction of a Noti deavage site in the middle of E 

30 coli RS having Genbank accession No. X691 09 (hybridizes to nt 1 190-1 208). 

SEQ ID NO:23 is the 5' primer useful for the removal of the transit peptkie from the cloned spinach RS precursor. 
SEQ ID NO:24 is the 3' primer useful for the removal of the transit peptide from the doned spinach RS precursor. 
SEQ ID NO:25 is the 5' primer useful for the removal of the transit peptide from the cloned spinach LS precursor. 
SEQ ID NO:26 is the 3" primer useful for the removal of the transit peptide from the cloned spinach LS precursor. 

35 SEQ ID NO:27 is the nucleotide sequence of a cloned cDNA encoding a spinach LS precursor with its transit pep- 
tide. 

SEQ ID NO:28 is the deduced amino acid sequence of the doned cDNA encoding a spinach LS precursor with its 
transit peptide. 

SEQ ID N0:29 is the nucleotide sequence of a cloned cDNA encoding a tobacco LS precursor with its transit pep- 
40 tide. 

SEQ ID NO:30 is the deduced amino acid sequence of the doned cDNA encoding a tobacco LS precursor with its 
transit peptide. 

SEQ ID N0:31 is the nucleotide sequence of a cloned cDNA encoding an arabidopsis LS precursor with its transit 
peptide. 

45 SEQ ID NO:32 is the deduced amino acid sequence of the doned cDNA encoding an arabidopsis LS precursor 
with its transit p^tide. 

SEQ ID NO:33 is the nudeotide sequence of a cloned cONA encoding a spinach RS precursor with its transit pep- 
tide. 

SEQ ID NO:34 is the deduced amino acid sequence of the doned cDNA encoding a spinach RS precursor with its 
so transit peptide. 

SEQ ID NO:35 is the nucleotide sequence of a cloned cDNA encoding an arabidopsis RS precursor with its transit 
peptide. 

SEQ ID NG:36 is the deduced amino acid sequence of the cloned cDNA encoding an arabidopsis RS precursor 
with Its transit peptide. 

55 SEQ ID NO:37 is tiie nucleotide sequence of a cloned cDNA encoding a Magnaporthe grisea LS. 

SEQ ID NO:38 is tiie deduced amino acid sequence of the cloned cDNA encoding Magnaporthe grisea LS. 
SEQ ID NO:39 is the highly conserved C-terminal amino acid sequence found in plant LS proteins. 
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[0026] Luminase synthase (LS) and nbofiavin synthase (RS). the ultimate and pentultimate enzymes of the spinach 
riboflavin biosynthetic pathway have been cloned by use of function complementation of E. coli auxotrophs. 
[0027] Nucleic add fragments that respectively encode LS protein and RS protein have been isolated from plants 
and fungi. LS and RS genes from other plants and fungi can now be identified by comparison of random cDNA 
5 sequences to the sequences provided by Applicants. The invention includes assays using these nucleic add fragments 
to screen for crop protection chemicals related to the enzymatic pathway and methods for alteraig the levels of produc- 
tion of LS and RS enzymes in a host cell. 

[0028] In this disclose* a numba* of terms and abbreviations are used. The following definitions are provided. 

[0029] "Lunrtazine synthase" is abbreviated as LS. 
10 [0030] '^Riboflavin synthase" is abbreviated as RS. 

[0031 ] ''Fla^^n mononucleotide" is atsbrevtated as FMN. 

[0032] "Flavin adenine dlnudeotide" is abbreviated as FAD. 

[0033] "Polymerase chain reaction" Is atjbreviated PGR. 

[0034] "Expressed sequence tag" is abbreviated EST. 
IS [0035] "Dimethyl sulfoxide" is abbreviated DMSO. 

[0036] •6,7-Dimethyl-8-(V-D-ribityl)Iuma2lne" is abbreviated DMRL 

[0037] "4-Rib{tylamino-5-amino-2.6-dihydroxypyrimidine" is abbreviated RAADP. 

[0038] "3,4-DihydrQxybutanone 4-phosphate" is abbreviated DHBR 

[0039] "lsopropyi-1 -thio-p-D-galactopyranoside" is abbreviated IPTG. 
20 [0040] "Sodium dodecylsulfate-polyacrylamide gel electrophoresis'* is abbreviated SOS-PAGE. 

[0041 ] "Open reading frame" is abbreviated ORF. 

[0042] An "isolated nudeic add fragment* is a polymer of RNA or DNA that is single- or double-stranded, optionally 
containing synthetic, non-natural or altered nucleotide bases. An isolated nucleic acid fragment in the form of a polymer 
of DNA may be comprised of one or more segments of cDNA, genomic DNA or synthetic DNA, 
25 [0043] "Mature" protein refers to a functional LS or RS enzyme without its transit peptide "Precursor" protein refers 
to the mature protein with a native or foreign transit peptide. The temi transit peptide" refers to the amino terminal 
extension of a polypeptide, which is translated in conjunction with the polypeptide forming a precursor peptide and 
which is required for its uptake by organelles such as plastids or chioroplasts. 

[0044] "Auxotrophy" refers to the nutritional requirements necessary for growtii, sporulation and crystal production 
30 of the miaoorganlsm. For the purpose of this invention, the term "auxotroph" is defined herein to mean an organism 
which requires the addition of riboflavin for growtii! 

[0045] The terms "host cell" and "host organism" refer to a cell capable of receiving foreign or heterologous genes 
and expressing those genes to produce an active gene product. Suitable host cells indude microorganisms such as 
bacteria and fungi, as well as plant ceils. 
35 [0046] The terms "lumazine synthase" or "LS" are used interchangeably with "6,7-dimetiiyl-8-ribityllumazine syn- 
thase" and refer to a plant or fungal enzyme that catalyzes the condensation of 3,4-dihydroxy-2-butanone 4-phosphate 
with 4-ribitylamino-5-amino-2,6-dihydroKypyrin™dine (RAADP) to yield ortiiophosphate and 6.7-<limethyl-8-(V-D-ribityl)- 
lumazine (DMRL). 

[0047] The terms "riboflavin synthase" or "RS" refer to a plant or fungal enzyme that catalyzes the dismutation of 
40 6,7-dimethyl-8-(V-D-ribityl)-luma2ine (DMRL) to yield riboflavin and 3.4<lihydroxy-2-butanone 4-phosphate (DHBP) 
with 4-ribitylamino-5-amino*-2,6-dlhydroxypyrimidlne (RAADP). 

[0048] The term "metabolic activity" refers to the normal cellular activity needed to support growtii. As used herein 
agents such as crop protection chemicals that will inhibit metabolic activity will also generally inhibit cell growth. 
[0049] The term, "substantially similar" refers to nucleic acid fragments wherein changes in one or more nucleotide 

4S bases result in substitution of one or more amino acids, but do not affect the functional properties of tiie proton encoded 
by tiie DNA sequence. "Substantially similar" also refers to nudeic acid fragnrtents wherein changes in one or more 
nudeotide bases do not affect the ability of the nucleic acid fragment to mediate alteration of gene expression by anti- 
sense or co-suppression technology. "Substantially similar" also refers to modifications of the nudeic acid fragments of 
the instant invention such as deletion or insertion of one or more nucleotide bases that do not substantially affect the 

50 functional properties of tiie resulting transcript vis-^-vis the ability to mediate alteration of gene expression by antisense 
or co-suppression technology or alteration of tiie functional properties of tiie resulting protein molecule. It is therefore 
understood that the invention encompasses more than tiie specific exemplary sequences. 

[OOSO] A "substantial portion" refers to an amino add or nucleotide sequence which comprises enough of the amino 
add sequence of a polypeptide or the nudeotide sequence of a gene to afford putative identification of that polypeptide 
55 or gene, either by manual evaluation of the sequence by one skilled in tiie art or by computer-automated sequence 
comparison and identification using algorithn^ such as BLAST (Basic Local Alignment Search Tool; Aftschul et at., J. 
Mof.BioL 215:403-410 (1993); see also www.ncbi.nlm. nih.gov/BLAST/). In general, a sequence often or more contigu- 
ous amino acids or thirty or more nudeotides Is necessary in order to putatively identify a polypeptide or nudeic add 
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sequence as homologous to a known protein or gene. Moreover, with respect to nucleotide sequences, gene specific 
oligonucleotide probes comprising 20-30 contiguous nucleotides may be used in sequence-dependent mettiods of 
gene identification (&g.. Southern hybridization) and isolation (e.g.. in situ hybridization of bacterial colonies or bacte- 
riophage plaques}. In addition, short oligonucleotides which may be a portion of the sequence desired (generally 12 

5 bases or longer) may be used as amplification primers in PCR in order to obtain a particular nucleic acid fragment com- 
prising the primers. Accordingly, a ^'substantial portion" of a nucleotide sequence comprises enough of the sequence to 
afford specific identification and/or isolation of a nucleic add fragment comprising the sequence. The instant specifica- 
tion teaches partial or complete amino add and nucleotide sequences encoding one or more particular plant proteins. 
The skilled artisan, having the benefit of the sequences as r^orted herein, may now use all or a substantial portion of 

10 the disclosed sequences for the purpose known to those skilled in the art Accordingly, the instant Invention comprises 
the complete sequences as reported in the accompanying Sequence Listing, as well as substantial portions of those 
sequences as defined above and any protein fragments which retain enzymatic activity, and any nucleic acid 
sequences which encode protein fragments which retain enzymatic activity. 

[0051] For example, it is well known in the art that antisense suppression and co*suppression of gene e)q3ression 
75 may be accomplished using nucleic acid fragments representing less tiian the entire coding region of a gene, and by 
nucleic acid fragments that do not share 1 00% identity with the gene to be suppressed. Moreover, alterations in a gene 
which result in the production of a chemically equivalent amino acid at a given site, but do not effect the functional prop- 
erties of tiie encoded protein, are well known in the art Thus, a codon for the amino acid alanine, a hydrophobic amino 
acid, may be substituted by a codon encoding another less hydrophobic residue, such as glycine, or a more hydropho- 
20 bic residue, such as valine, leucine, or isoleucine. Similarly, changes which result in substitution of one negatively 
charged rB&due for another, such as aspartic add for glutamic acid, or one positively charged residue for another, such 
as lysine for arginine, can also be expected to produce a functionally equivalent product Nucleotide changes which 
result in alteration of the N-terminal and C-termiral portions of the protein molecule would also not be expected to alter 
the activity of the protein. Each of tiie proposed modifications is well witiiin tiie routine skill in the art. as is determination 

6 of retention of biological activity of the encoded products. 

[0052] Moreover, the skilled artisan recognizes tiiat substantially similar sequences enconpassed by this invention 
are also defined by tiieir ability to hybridize, under sti-ingent conditions (0.1X SSC. 0.1% SDS. 65 ''C). with tine 
sequences exemplified herein as well as sequences which, but for the degeneracy of the code would hybridize under 
these conditions. Prefered substantially similar nucleic acid fragments of the instant invention are tiiose nucleic acid 
30 fragments whose DNA sequences are 80% identical to tiie DMA sequence of the nudeic acid fragments reported 
herein. More preferred nudeic acid fragments are 90% identical to tiie DNA sequence of tiie nucleic acid fragments 
reported herein. Most preferred are nudeic add fragments that are 95% identical to tiie DNA sequence of the nudeic 
acid fragments reported herein. 

[0053] Homologous sequences are those displaying at least 50 or 60%, preferably at least 70. 80 or 90% sequence 
35 homology. The invention also relates to nudeic add sequences which encode polypeptides having at least 70 or 72% 
identity witii the amino acid sequences of tiie invention, preferably at least 80. 85, 90 or 95% identity. 
[0054] A nudeic acid molecule is ''hybridizable" to another nucleic acid molecule, such as a cDNA. genomic DNA. 
or RNA. when a single stranded form of the nucleic acid molecule can anneal to the other nucleic add n%>lecule under 
the appropriate conditions of temperature and solution ionic strengtii. Hybridization and washing conditions are well 
40 known and exemplified in Sambrook. J.. Fritsch, E.F. and Maniatis, T. Molecular Cloning: A Laboratory Manual. Second 
Edition. Cold Spring Harbor Lafioratory Press, Cold Spring Hartjor (1989), particularly Chapter 11 and Tafc)le 11.1 
therein (entirely incorporated herein by reference). The conditions of temperature and ionic strength determine tiie 
"stringency" of the hybridization. For preliminary screening for homologous nucleic adds, low stringency hybridization 
conditions, corresponding to a T/n of 55*0, can be used. e.g. 5X SSC, 0.1% SDS. 0.25% milk, and no tomnamide; or 
45 30% fbrmamide, 5X SSC. 0.5% SDS. Moderate stringency hybridization conditions correspond to a higher Tm, e.g. 
40% formamide, with 5X or 6X SSC. 

[0055] Hybridization requires tiiat the two nucleic acids contain complementary sequences, although depending on 
the stringency of the hybridization, mismatches between bases are possible. The appropriate stringency for hybridizing 
nud ic acids depends on the length of the nudeic acids and the degree of complementation, variables well known in 

so the art. The greater the degree of similarity or homdogy between two nucleotide sequences, the greater the value of 
Tm for hybrids of nucleic acids having those sequences. The relative stability (corresponding to higher Tm) of nucleic 
acid hybridizations decreases in tiie following order: RNA:RNA. DNAiRNA. DNA:DNA. For hybrids of greater than 100 
nudeotides in lengtii. equations for calculating Tm have been derived (see Sambrook et al.. supra. 9.50-9.51). For 
hybridizations witii shorter nudeic adds. i.e. oligonucleotides, tiie position of mismatches becomes more important, 

55 and the length of the oligonucleotide determines its specificity (see Sambrook et al.. supra. 1 1 .7-1 1 .8). In one embod- 
iment the lengtii for a hybridizable nudeic add is at least about 10 nudeotides. Preferable a minimum length for a 
hybridizabie nucleic add is at least about 15 nudeotides; more preferably at least about 20 nucleotides; and most pref- 
erably the lengtii is at least 30 nucleotides. Furtiiermore, tiie skilled artisan will recognize tiiat tiie temperature and 
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rice jTaJ^/s^:^^^^^ ? Jr^.e 

relates to any nucleic acid fragment that encoSlTor l^^^J^^T^ Accordingly, the Instant Invention 
LS or RS biosynthetic enzymes as set forttTn^lo ID ^Sl^ Ko^^^ 

NO:10. SEQ ID N0:12 or SEQ ID NO-38 Th^sS^^ L^In n '° ^0:8. SEQ ID 

hostcellln usage Of nudec^decSonstosp^t^^^^ ^^'.'bited by a specific 

expression in a host cell, it is desirable to d'Lsig^ ttfj gie^^'SS ^tZt^^^"^ " ^""^ '"^"^ 
quency of preferred codon usage of the host cell frequency of codon usage approaches the fre- 

" ^sJUTmr^I^Sefeq^e^^^^^^^ - polypepWe 

also means the degree of SJLcrTd^t^^ j!^^^ ""^^"9 sequences. In the art. Tdentfty 
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by known methods, including but not limited to tt^l^iS^ i^;^ ^ ^"^^^^ be readily calculated 

15 Oxford University Press t^S.^[^^S^f^T^^ SomPUtational Moler,.lf,r pipmgy (Lesk. A. M.. ed.) 
demic Press. New V^frS) (Sil!?2' A^2T^!?c^ 'nfbrmaf*^'; ^nr\ fiennme Prnip n^ (Smith. D. W.. ed.) Aca- 
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(1987): and Seouen.. A^Til^L g ^^Tf,"^^^^^ (vP" Heinje. G.. ed.) Academic Presi 

ferred meth odrto determineTeS^a'^/ ^g'S ^Z^Z^t^^l'^l'''^- (^^^>- 
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pai™rise alignments Wuplei. Asl £ZS penalty=1 1. gap length penalty=3; for 

example. 95% •identftvrtoT.XelL!^^^ ^^^'"9 ^ ""^leotide sequence having at l4st for 

deotR^e is identicalTth^rrce's'^u^^^^^^ 

mutations per each 100 nucleotides of the ^er^^onZ^fl^^^ sequence may include to five point 
having a nucleotide sequence m lit 9sS „ ?^ sequence. In other words, to obtain a polynucleotide 

the reference sequenS^ be dS^s^SJ^ T ^ ""''^^'^^ '^"®"<=®' "P to 5% <rf the nucleotides in 
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of a number of commercially available machines. Accordingly, the genes can be tailored for optimal gene expression 
based on optimization of nucleotide sequence to reflect the codon bias of the host cell. The skided artisan appreciates 
the likelihood of successful gene expression if codon usage is biased towards those codons favored by the host Deter- 
mination of preferred oodons can be based on a survey of genes derived from the host cell where sequence irrtormatm 
5 is available. 

[0060] "Gene' refers to a nucleic add fragment that expresses a ^ecific proteia including regulatory sequences 
preceding (5* non-coding sequences) and following (3* non-coding sequences) the coding sequence. 'Native gene' 
refers to a gene as found in nature with its own regulatory sequences. "Chimeric gene' refers to any gene that is not a 
native gene, comprising regulatory and coding sequences tiiat are not found together in nature. Accordingly, a chimeric 

10 gene may conrprise regulatory sequences and coding sequences that are derived from cfifferent sources, or regulatory 
sequences and coding sequences derived from the same source, but arranged in a manner different than that found in - 
nature. 'Endogenous gene" refers to a native gene in its natural location in the genome of an organism. A "foreign' gene 
refers to a gene not normally found in the host organism, but that is introduced into the host organism by gene transfer. 
Faeign genes can comprise native genes Inserted into a non-native orgemism, or chimeric genes. A Iransgene' is a 

IS gene tiiat has been introduced into tiie genome by a transformation procedure. 

[0061] "Coding sequence' refers to a DNA sequence that codes for a specific amino acid sequence. "Regulatory 
sequences" refer to nucleotide sequences located upstream (5' non-coding sequences), witiiin, or downstream (3' non- 
coding sequences) of a coding sequence, and which influence the transcription. RNA processing or stability, or trans- 
lation of the associated coding sequence. Regulatory sequences may include promoters, translation leader sequences, 

20 irttrons, and poiyadenylation recognition sequences. 

[0062] "Promoter" refers to a DNA sequence capable of controlling the expression of a coding sequence or func- 
tional RNA. In general, a coding sequence is located 3' to a promoter sequence. The promoter sequence consists of 
proximal and more distal upstream elements, tfie latter elements often referred to as enhancers. Accordingly, an 
"enhancer" is a DNA sequence which can stimulate promoter activity and may be an innate element of tiie promoter or 

25 a heterologous element inserted to enhance the level or tissue-specifidty of a promoter. Promoters may be derived In 
their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, 
or even conprise synthetic DNA segments. It is understood by tiiose skilled in tiie art that different promoters may direct 
. the expression of a gene in different tissues or cell types, or at different stages of development, or in response to differ- 
ent environmental conditions. Promoters which cause a gene to be expressed in most ceil types at most times are com- 

30 monly referred to as "constitutive promoters'. New promoters of various types useful in plant cells are constantiy being 
discovered; numerous examples may be found in tine compilation by Okamuro and Goldberg, {Biochemistry of Plants 
15:1-82 (1989)). It is furtiier recognized that since in most cases tiie exact boundaries of regulatory sequences have 
not been completely defined. DNA fragments of different lengtiis may have identical promoter activity. 
[0063] The 'translation leader sequence" refers to a DNA sequence located between tiie promoter sequence of a 

35 gene and the coding sequence. The translation leader sequence is present in tiie fully processed mRNA upsti'eam of 
the translation start sequence. The translation leader sequence may affect processing of the primary franscript to 
mRNA. mRNA stability or translation effidency. Examples of translation leader sequences have been described (Turner 
etal.. Mol. Biotech. 3:225 (1995)). 

[0064] The "3* non-coding sequences* refer to DNA sequences located downstream of a coding sequence and 
40 include poiyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting 
mRNA processing or gene expr^lon. The poiyadenylation signal is usually characterized by affecting the addition of 
polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is exempli- 
fied by Ingelbrecht et al., Plant Ceil 1 :671-680 (1989). 

[0065] "RNA franscript' refers to the product resulting from RNA polymerase-catalyzed franscription of a DNA 
45 sequence: VS/hen the RNA transcript is a perfect complementary copy of the DNA sequence, it is referred to as the pri- 
mary franscript or it may be a RNA sequence derived from posttranscriptional processing of the primary transcript and 
is refen-ed to as the mature RNA. "f^essenger RNA* (mRNA) refers to the RNA that is without introns and that can be 
translated into protein by the cell. "cDNA" refers to a double-stranded DNA that is complementary to and derived from 
mRNA. 'Sense* RNA refers to RNA franscript tiiat includes the mRNA and so can be translated into protein by tiie cell. 
so "Antisense RNA' refers to a RNA franscript tiiat is complementary to all or part of a target primary transcript or mRNA 
and that blocks the expression of a target gene (US 5,1 07.065). The complementarity of an antisense RNA may be witii 
any part of the specific gene franscript. i.e.. at tiie 5' non-coding sequence. 3' non-coding sequence, infrons. or the cod- 
ing sequence. "Functional RNA" refers to antisense RNA. ribozyme RNA. or other RNA that is not franslated yet has an 
effect on cellular processes. 

55 [0066] The term "operably linked" refers to tiie association of nucleic acid sequences on a single nucleic acid frag- 
ment so that tiie function of one is affected by tiie other. For example, a promoter is operably linked witii a coding 
sequence when it is capable of affecting tiie expression of tiiat coding sequence (i.e.. tiiat tiie coding sequence is under 
the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in sense 
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S„ T '^r^'^^ *° '^"^ °' expression of the LS or RS genes in plants for some applica- 

sfructed by linking the genes or gene fragments encoding parts of these enzymes to plant promter sequences ThS 

mL?™ V !!? ""^"^ '° P'^"* P^°"«** sequences. TTie co^uppression SsSisech" 

cens for produ<*on of the instant LS or RS protein are microbial hosts. Microbial expression systems and ^r^i^ 
vectors containing regulatory sequences that direct high level expression of fbr«g?ir?JS^ ^S^Tc^o 
skilled in the art. Any of these could be used to construct chimeric genes fbr production of Se S^J^ LS^STSs 

.5 s;?sr5^eTnrrs^irr'"^~«'^^ 

Sressio^n 0;'*!'^°^ the expression of the instant LS and RS enzymes include any cell capable 

S^l^rS » ^ T^!!-^- - ^P^'^"""^- Saccharomyces. Pichia. Candida, and Hansenula). membere <S the genS 

20 Sh^2*™'"''"''T*!'" 5^-°"-"^' SNgella). Methods for the tSsfor^^SonS 

ao such hoste and the expression of foreign proteins are well known in the art and examples of suitable orotocolsn^v be 

SS-at^: a°n;fr^A^t9 af ^^ ^^^^^^^ ^ ^ ^' rmMn.n .rSecorxf Ed^oX'Sr 

fSSiiw th!^***!? '^^^ fransfbrmation of suitable microbial host cells are well known in the art 

ISL rnJIL Z "^^^ '°?.*^'"' ^'''"'^ '^'^^■"9 transcription and translation of the rel^rt gene a 

IW)90] Initiafton control regions or promoteis. which are useful to drive expression of the genes encodina the RS or 

?^M?^r^^^^^^^ " P'^^ '"^'"^"S but not Bmited to CYCif HIS3 SJrSuo 

w • ADC1. TRP1. URA3. LEU2. ENO. TPI (useful for expression in SaSS^O^s) S 

Ojeftil for expressKjn in Piohia); and lac. trp. IP^. IPr. 17. tac. and trc (useful for ©pression in Em J<^Si^. 

nsa^x^rrr^r^in^^^^^ 

denved from the punfied instant plant proteins, or through random in vitro screening of chemical libraries LS and RS 
?^DlJ;',S*i;r,°'S"f" '"^r^^'^ P'^"^ ^"'^ microorganism, 

TS?^^ ^ ' 9™"PS tor a number of important redox enzymes. Consequently it isXiticioatS 

Tsrarc'^rprrtu^^^^^^^^ 

InH ^C!" *!! fras-^ents of the instant invention may also be used as probes for genet- 

Rr2^ maSLn^i • J"f*^^"^ """^^ may be used as restriction fragment length polymo^ism 

^ ^^'^^ °^ restriction^igested plant genomic DNA may be prob^ wfth STe^ete 
acKl fragments of the instant inventfon. The resulting banding patterns may then be subjected to gSJdcTralvi^ ts^n 
computer programs such as MapMaker (Lander et at. Genomics 1-174-181 l^i^ZZ^^^J^ ^ 

ScSof eS: • ^^"^^'^ bTuil^^pret^e^blTcXntg 

Srcro^Jte^fi^"^^^^^^ * ^«P^«^^*9 P-^«« progeny of a^S 

S i^u^«?n?hfn?ni^^ polymorphisms .s noted and used to calculate the position of the instant nucleic 

r0094] The productfon and use of plant gene-derived prebes fbr use In genetic mapping is described by Bernatzky 



13 



EP 1 010 760 A1 



and Tanksley {Plant Mol. Biol. Reporter 4:37-41 (1986)). Numerals publications describe genetic mapping of specif to 
cDNA clones using the methodology outlined above or variations thereof. For example, F2 interaoss popuiadons. back- 
cross populations, randomly mated populations, near isogenic lines, and other sets of individuals may be used for map- 
ping. Such methodologies are well known to those skilled in the art 
5 [0095] Nucleic add probes derived from the instant nucleic add sequences may also be used for physical mapping 
(i.e.. placement of sequences on physical maps; see Hoheise! et al.. Nonmammalian Genomic Analysis: A Practical 
Guide, pp. 319-346. Academic Press (1996). and references cited therein). 

[0096] In another embodiment nucleic add probes derived from the instant nudetc add sequence may be used in 
direct fluorescence in situ hybridization (FISH) mapping. Although cun-ent methods of FISH mapping favor ise of large 
10 clones (several to several hundred . kb), improvements in sensitivity may allow performance of FISH mapping using 
shorter probes. 

[0097] A variety of nucleic add amplif icatipn-based methods of genetic and physical mapping may be carried out 
using the instant nudeic add sequences. Examples include alleie-spedf ic amplification, polymorphism of PdR-ampli- 
fled fragments (CAPS), allele-specific ligation, nudeotide extension reactions. Radiation Hybrid Mapping and Happy 

75 Mapping. For these methods, the sequence of a nucleic add fragment is used to design and produce primer pairs for 
use in the amplification reaction or in primer extension reactions. The design of such primers is well known to those 
skilled in the art. In methods employing PCR-based genetic mapping, it may be necessary to identify DNA sequence 
differences between the parents of the mapping cross in the region corresponding to the instant nudeic add 
sequences. This, however, is generally not necessary for mapping methods. Such information may be useful in plant 

20 breeding in order to develop lines with desired phenotypes. 

EXAMPLES 

[0098] The present invention is further defined in the following Examples, in which all parts and percentages are by 
25 weight and degrees are Celsius, unless othenivise stated. It should be understood that these Examples, while indicating 
preferred embodiments of the invention, are given by way of iliustratbn only. From the above discussion and these 
Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from 
the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usage 
and conditions. 

30 [0099] Standard recombinant DNA and molecular cloning techniques used here are well known in the art and are 
described by Sambrook. J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
Laboratory Press. Cold Spring HartDor. 1989; and by T. J. Silhavy, M. L. Bennan. and L W. Enquist. Experiments with 
Gene Fusions. Cold Spring Hartjor Laboratory Press. Cold Spring, N.Y. (1984) and by Ausubel, F M. et aL. Current Pro- 
tocols in Molecular Biology, pub. by Greene Publishing Assoc. and Wiley-lntersdence (1987). 

35 [0100] Manipulations of genetic sequences were accomplished using the suite of programs available from the 
Genetics Computer Group Inc. (Wisconsin Package Version 9.0, Genetics Computer Group (GCG), Madison, Wl). 
[0101] The meaning of abbreviations is as follows: "sec" means second(s), "min" means minute(s), "h" means 
hour(s), "d" means day(s).'"nr means microliter, "mL" means milliliters, "L" means liters, "mM" means millimolar, "M" 
means molar, "mmoP means millimole(s). 

40 

EXAMPLE 1 

PQR-Clpninq qf QQli Lg ^p^ 

45 [0102] Gene specific PCR primers were used to amplify the E. coli LS and RS genes from genomic DNA, while 
adding unique restridion sites to their flanking regions for subsequent ligation into high copy number plasmids. The 
primers used for this purpose were based on the published DNA sequences of the E. coli LS and RS genes (GenBank 
accession numbers X64395 and X69109, respectively) and consisted of the following nucleotides: 

so Primer 1 - (SEQ ID NO:13): 

[0103] 

5'-CGA AGG AAG ACC at^ ATT ATT GAA GCT AAC GTT GC-3- 

55 

Primer 2 -(SEQ ID N0:14): 
[0104] 
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5'-ATC TTA CTg teg a cT TCA GGC CTT GAT GGC TTT C- 3' 

Primer 3 -(SEQ ID N0:15): 
5 [0105] 

S - ACT CAT TTA cca tgg OTA CGG GGA TTG TAC AGGGC- 3 ' 
Primer4-{SEQIDNO:16): 

10 

[0106] 

5*-ATG TTA CTg tcg acT TCA GGC TTC TGT GCC TGG TT- 3^ 

75 The underlined bases hybridize to the target genes, while lower case letters indicate the restriction sites (Ncol or Sail} 
that were added to the ends of the PGR primers. 

[0107] Ampliflication of the LS gene was achieved using Primers 1 and 2. and genomic DNA from E. coli strain 
W3110 (Campbell et al.. Proa Natl. Acad ScL 755276-2284 (1978)). Primer 1 hybridizes at the start o1 the gene and 
introduces a Ncol site at the protein's initiatton codon, while Primer 2 hydridizes at the opposite end and provides a Sail 

20. site just past the termination codon. The 1 0O-y} PCR reactions contained "100 ng of genomic DNA and both primers at 
a final concentration of 0.5 ^M. The other reaction conponenls were provided by the GeneAnp PCR Reagent Kit (Per- 
kin Elmer), according to the manufacturer's protocol. Amplification was carried out in a DNA Thermocycler 480 (PerWn 
Elmer) for 28 cycles; each comprising 1 min at 94 ''C. 2 min at 53 '^C. and 2 min at 72 ""C, Following the last cycle, there 
was a 7-min extension period at 72 ""C. The PCR product was cut with Ncol and Sail, and ligated into similariy digested 

25 pGEM-5Zf (+) (Promega. Madson, Wi). The latter was chosen as a suitable cloning vector since it lacks a Notl deavage 
site after double-digestion with Ncol and Sail (see below). The ligation reaction mixture was used to transform E. coii 
DH5a competant cells (QibcoBRL). and transformants were selected on LB media supplemented with 100 uQ/wL amp* 
idllin. 

[0108] The £ coli RS gene was amplified from genomic DNA in a similar manner using Primers 3 and 4. The 
30 former introduces a Ncol site at the protein's initiation codon. while the latter provides a Sail site just after the stop 
codon. Subsequent steps, including ligation of the PCR product into pGEM-5Zf (+) and transformation of DH5a with the 
resulting construct were exactly as described above. 

[0109] Plasmids harboring the cloned E. coll LS and RS genes were identified by restriction digestion analysis. 
Plasmid DNA was isolated from a number of ampicillin-resistant colonies using the Wizard DNA Purification System 
35 (Promega, Madison, WI) and subjected to cleavage with Ncol and Sail, the samples were analyzed by agarose gel 
electrophoresis, and a representative plasmid for each gene, yielding inserts of the correct size, was sequenced com- 
pletely to verify the absence of PCR errors. Apart from those nucleotides at the 5' and 3* ends that were intentionally 
altered for cloning purposes, the amplified E. cofi LS and RS gene sequences were identical to those reported in the 
literature. 

40 

EXAMPLE? 

Insertionai inactivation of the E. cofi LS and RS Genes 

[0110] In order to create bacterial auxotrophs tacking the ability to synthesize riboflavin, the cloned E. coii LS and 
RS genes were rendered nonfunctional through insertionai inactivation. Briefly, a unique Notl site was introduced in the 
middle of the coding region of each of the target genes, and a DNA fragment that confers Kanamycin resistance was 
ligated into the engineered sites. The latter was provided by the commerically available Kan^'GenBlock cartridge (Phar- 
macia), that was modified through PCR to add Notl cleavage sites at both of its ends. This modification was accom- 
plished using Primers 5 and 6 in a standard PCR reaction; the underlined portions hybridize to the Kan^ GenBlock, and 
lower case letters indicate the Notl cleavage sites. 

Primer5-(SEQIDNO:17): 

55 [0111] 

5'-AAC TAG ATC Age ggc cgc AGC CAC GTT GTG TCT CAA A- 3' 
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Primer 6 - (SEQ ID N0:18): 
[0112] 

5 ff^C AAA CAT Age ggc cac TGA GGT CTG CCT CGT GAA>3' 

Following amplification, the modified Kanf QenBlock was cleaved with Notl. and the resulting fragment was purified by 
agarose gel electrophoresis. 

[0113] PGR primers were also used to introduce a unique Notl cleavage site in the middle of the two target genes. 

10 This was accomplished through an application of the Inverse PGR" technique that is fully described by Ochman. et al. 
in PGR Protocols: A Guide to Methods and Applications. (Innis et al.. eds.) pp. 219-227. Academic Press, San Diego. 
OA. (1990). The targets for inverse PGR are usually double-stranded circular DNA nrK)lecules. However, in contrast to 
other PGR applications, the two primers are oriented away from each other such that their 3* ends are extended in oppo- 
site directions around the entire circular template. If the primers are designed to hybridize immediately adjacent to each 

15 other, a linear DNA fragment is produced that includes the entire vector sequence and has as its starting ard stopping 
points the original primer binding sites. The net result is analogous to linearizing a circular plasmid at a specified loca- 
tion. By attaching appropriate nucleotide sequences to the nonhybridiztng 5" ends of both PGR primers, it is therefore 
possible to introduce a unique restriction site at any desired location within a circxilar template. 
[0114] Primers 7 and 8 (which hybridize to nt 2273-2290 and nt 2243-2261 of the DNA sequence in GenBank 

20 accession number X64395. respectively) were designed to introduce a Notl cleavage site in the middle of the £ colt LS 
g ne: the nucleotides that hybridize to the target gene are underlined, and Notl cleavage sites are indicated in lower 
case letters. 

Primer 7- (SEQ ID N0:19): 

25 

[011SI 

S'-AAC TAG ATG Age ggc cgc OGT ACQ GTT ATT CGT t3GT-3' 
30 Primer8-LS(SEQIDNO:20): 
[01161 

5-GAC AAA GAT Age g oc cac GTC GTA TTT ACC GGT- 3' 

35 

Primers 9 and 1 0 (which hybridize to nt 121 7-1233 and nt 1 190-1208 of GenBank accession nuntber X69109. respec- 
tively) were used to introduce a Notl cleavage site in tiie middle of tiie E. coli RS gene. 

Primer9-(SEQ ID NO:21): 

40 

[01171 

5-AAC TAG ATG Age ggc cgc ACC ACT GCT GAA GTG GG- 3' 
45 Primer 10 - RS (SEQ ID N0:22): 
[01181 

ff-GAC AAA CAT Age ggc cgc GAG CTG ACA TTA AGT GTC C- y 

so 

[01 1 9] The circular templates for inverse PGR were the pGEM-5Zf (+) constructs containing the E, coli LS and RS 
genes. The 1 00-jil PGR reactions contained 0.5 ng of plasmid DNA and each of the appropriate primers at a final con- 
centration of 0.5 mM. Amplification was canied out in a DNA Thermocycler 480 (Perkin Elmer) for 30 cycles, each com- 
prising 50 sec at 94 ""C, 1 min at 55 •G. and 3 min at 72 **G. The PGR products were cleaved with Notl and the resulting 
55 fragments were purified by agarose gel electrophoresis; the excised bands were of the expected size. Next, tiie purified 
fragments were recircularized with T4 DNA ligase (Novagen) to regenerate functional plasmids, and aliquots of tiie liga- 
tion reaction mixtures were used to transform E, coli DH5a competent cells (GibcoBRL). Growth was selected for on 
IB media containing ampidllin (100 iig/mL), and plasmid DNA was isolated from a number of transformantsfor restric* 
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tfon digestion analysis with Notl. Sail, and Ncol. For each of the target genes, a representative plasmid yieMing the cor- 
rect cieawage patterns with these enzymes was selected for further manipulation. 

E?l^°^ . kanamycin resistance gene, the two plasmid constructs described above were cleaved with 

Notl and purified by agarose gel electrophoresis. Each of the fragments was then individuaUy incubated with a 4-fbld 
r?nS:^^ "'/S® "^"l^. !^"' *=**"9e. and sut^jected to a standard ligation reaction in the presence of 

»^-^^J^°'^3^^- °* ^ '^''^"^es were used to transfomi E. coli DH5a competant 

cens (GibcoBRL). and growth was selected for on LB plates containing kanamydn (30 Mg/mL) and ampiciUin (100 
MQ/mLY Plasmids hailDoring the dismpted £ coli LS and RS genes were identified by restriction digestion analysis The 
plasmids were deaved with Ncol and Sail, and were then subjected to agarose gel electrophoresis to check for the 
presenceofttie inserted kanamycin resistance gene. Representative plasmids, yielding fragments of tiie correct size 
were selected for further manipulation. DNA sequence analysis of these plasmids confirmed «iat the kanamydn resist- 
ance gene had been inserted at the correct location in both target genes. 

EXAMPLE .-^ 

Generation of E. coli LS and rs AinmimphQ 

[0121 ] The Insertionally inactivated E coli LS and RS genes were liberated from ttie plasmid constructs described 
above using Ncol and Sail and purified by agarose gel electrophoresis. Each of tfie fragments was ttien individuaHy 
intr^uced into El coli strain ATCC 47002 (fully described In Balbas et al.. Gene 1 36:21 1-213 (1993). and isogenic with 
JC7623 (descnbed by Bachmann, B.. in E. coli and S^lmnn^H^i.^;rr,..ri^ . m: Cdi.iiar anri Molecular Knlnnv rMi^ho»4f 
et al. eds.) p. 2466. American Sodety of Mterobiology. Washington. D.C. (1987)) by eledroporatation using a B7X 
Transfedor 100 (Biotedinologies and Experimental Research Inc.) accoixling to tfie manufacturer's protocol Th 
choice this strain as the initial redpient for gene replacement was based on Its well established hyper-rec phenotype 
and related ability to undergo high frequency double-crossover homologous recombination (Wyman et al.. Proa Nat 
Acad. Sc,. USA 82:2880-2884 (1985): Balbas et al.. Gene 136:211-213 (1993): Balbas etal.. Gene 172«5.69 (1996))' 
Thus. It was anticipated ttiat ttie insertionally inadivated £ coli LS and RS genes would effidentiy replace their func- 
tional chromosomal counterparts in ATCC 47002 under kanamydn seledion. 

[0122] Following electroporation. the ti-ansformed cells were resuspended in 1 .0 mL of S.O.C. media (GibcoBRU 
supplemented with riboflavin (400 ^g/mL). and incubated for 1 h at 37 'C. Kanamycin resistance was then 

l^TJ^A^^Jf, '^T^^ ""^'"^ ^ <^ and kanamycin (30 Hg/mL); colonies 

geared 24-48 h later. Phenotypic detection of the con-ed diromosomal integration event was accomplished ttirough 
'"fiu'^r®*"^ experiments. Riboflavin auxoti-ophs resulting from double-crossover homologous recombination of either 
of the disrupted target genes would be expeded to be resistant to kanamydn. sensitive to ampidRin, and to exhibit 
^'?'!!LT^x"lf ® presence of added riboflavin. Representative baderial cdonies exhibiting this phenotype were 
seleded for further study. «. r- 

[0123] While ATCC 47002 is an excellent strain for creating £ coli Knockouts", its multiple mutations in ttie recBCD 
loa render it incapable of propagating ColEI -type plasmids (Balbas et al.. Gene 1 72:65-69 (1 996)). Consequenfly ttie 
riljoflavin auxotrophs described above are not suitable for saeening plasmid cDNA libraries by fundional complerJien- 
tation in order to achieve ttiis goal it was therefore necessary to move ttie insertionally inactivated LS and RS genes 
from ttie diromosome of ATCC 47002 to a suitable wiWtype background. This manipulation was accomplished ttirough 
generalized phage transdudion using P1^, and standard mettiodologies as fully described by Miller J H in Exoeri- 
'"Tu^h!!!".'^'"" ^""^"^^ ^ ^""9 Uboratory. Cold Spring Haibor. NeuJ YorKOg^TE 

Sirii? ^^^tfl'lS- 75^76-2284(1978)) was seleded as ttie redpient strain for ttie 

insertionally inadivated LS and RS genes. Following phage transdudion. baderial growtti was seleded for on LB 
rnedia tt«t was supplemented witti kanamydn (35 ^g/mL) and ribdiavin (400 ng/mL). Stable transdudants harboring 
ttie disrupted £ col, LS or RS genes were ttien identified ttirough replicaplating experiments analogous to ttiose 
described above for ATCC 47002. Thus, individual colonies were patdied onto plates containing LB media, sodium clt- 
iT <7-5 mM). 'rognesium sulfate (1.5 mM). and kanamyadn (35 ^gAnL). witti or wWiout riboflavin (400 ug/mL) The 
tl^ f w? *"^*°P*^« for furttier study and subsequent complementation cloning (see 

below) were only able to grow in the presence d added nDdlavin. and were nd resistant to ampicillin (100 ug/mL) or 
streptomyan (25 ugAnL): sensitivity to stireptomydn is charaderistic of W31 1 0. but nd of ATCC 47002. 

EXAMPIF A 

Cloning pf gplnach LS gnd RS Genes Through Functional r.nmnlqTi antf,ti». ^ 

[01241 A spinadi {Spinacea oleracea) cDNA expression library in Umbda Zap II was obtained from Stratagena (La 
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Joiia, CA). and objected to mass excision according to the manufacturer's protocol. L^n excision, the liberated cDNA 
inserts are contained in the plasmid vector pBluscript wfdch confers resistance to anvpidllin. and allows their expres- 
sion in £ coli upon induction with isopropyl-1-thio-p-D-gaiactopyranoside (IPTQ). The resulting mixture of exdsed plas- 
mids was then electroporated into the £ coli LS and RS riboflavin auxotrophs (both derivatives of W311 0) using a BTX 

5 Transfector 1 00 (Biotechnologies and Experimental Research Inc.) and the manufacturer's conditions. The transformed 
cells were selected tor growth in the absence of added riboflavin, on plates that contained B agar (LB media containing 
sodium dtrate (7.5 mM), magnesium sut^te (1.5 mM), kanamyacin (35 ^g/h1L), ampidllin (100 ^g/mLj. and IPTG (0.6 
mM)). Following a 48-hr incubation period at 37 °C, t>acterial growth was observed and riboflavin-ind^endant colonies 
were recovered at frequencies of about 4.2 x 10'^ and about 3.1 x 10'^. for the £ coli LS and RS auxotrophs. respec- 

70 tively. For each of the target genes., plasmid DNA was isolated from a representative colony and subjected to further 
analysis; both of the selected plasmids were capable of transforming the appropiate £ coli auxotroph to riboflavin pro- 
totrophy at high frequency. The cDNA inserts contained in these plasmids were then sequenced completely on an ABl 
377 automated sequencer (Applied Biosystems), using fluorescent dideoxy terminators and custom-designed primers. 
[0125] The approximately 1.2 kbp spinach cDNA insert that rescued the £ coli RS auxotroph dearly encodes a 

15 riboflavin synthase. The nucleotide sequence of the open reading frame (ORF) for this protein and its predicted primary 
amino add sequence are set forth in SEQ ID NO:33 and SEQ ID NO:34. respectively. While there are unmistakable 
similarities between the doned plant protein and known microbial RS homologs, including those from £ coli, B. subtilis, 
P. feiognathi, P. phosphoreum and S. cerevisiae (GenBank accession numbers, X69109, X51510, M90094, L11391and 
221621, respectively), there are also some significant differences. The most obvious one is that the spinach RS is a 

20 much larger protein. In comparison to its counterparts in microorganisms, it possesses an additional 69 amino acids at 
its N-terminus (Rgure 1). This N-terminal polypeptide extension is relatively basic, rich in Ser and Thr residues, and as 
such, resembles a chloroplast transit peptide (Gavel et al., FEBS Lett. 261:455-458 (1990)). This observation sug- 
gested that the spinach RS is synthesized as a nuclear-ericoded precursor protein, and is subsequently targeted to 
chloroplasts where it is proteolytically processed to its mature form. Indeed, based on the sequence alignments shown 

25 in Figure 1 (generated with the Genetics Computer Group Pileup program), tiie predicted cleavage-site for maturation 
occurs between amino add residues 69 and 70 of the spinach RS precursor, giving rise to a mature polypeptide with a 
molecular mass of 22.8 kDa (SEQ ID N0:8). That this assignment is cored is strongly supported by the fact that all 
known microbial RS homologs start with tiie pentapeptide motif Met-Phe-Thr-Gly-lle. which is apparently critical for 
function (Santos et a!., J. Biol. Chem, 270:437-444 (1995)). 

30 [0126] The notion that mature spinach RS is localized in chloroplasts is also supported by experimental evidence. 
Thus, antibodies directed against the purified recombinant protein (See below) specifically interact with a polypeptide 
of the expected size when spinach chloroplast extracts are subjected to SDS-PAGE and Western analysis. Otiier exper- 
iments have clearly demonstrated that tiie spinach RS precursor is Imported into plastids where it is proeolytically 
cleaved to its mature form. In these studies, tfie full-lengtii spinach RS precursor was labeled with [S^-metiiionine 

35 through in vitro ti-anscription/translation of the cloned gene, and subjected to in vitro protein inport assays (Cline et al., 
J. Biol. Chem. 260. 3691-3696 (1988): Viitanen et al.. J. Biol. Chem. 263. 15000-15007 (1988)) using intact isolated 
spinach chloroplasts. 

[0127] Of the various microbial homologs that are shown in Figure 1 . the mature spinach RS (SEQ ID N0:8) shows 
the greatest similarity to tiie yeast protein at the primary amino acid sequence level (e.g.. approximately 47% identity) 

40 . In contrast, the mature spinach RS is approximately only 35%. 42%. 34%, and 40% identical to the con-esponding pro- 
teins from £ colif B. subtilis, P. feiognathi, and R phosphoreum, respectively Taking into account tiie N-terminal chlo- 
roplast targeting sequence that Is unique to the plant protein tine evolutionary divergence is even greater. 
[0128] Similar obsen^tions were made witii the spinach cDNA insert tiiat was capable of restoring riboflavin pro- 
totrophy to tiie £ coli LS auxotroph. In tiiis case, the approximately 1.3 kbp DNA fragment of the rescuing plasmid c6n- 

45 tained an ORF that encoded a much larger tiian normal LS homolog; the DNA sequence and predicted amino acid 
sequence of the latter are shown in SEQ ID N027 and SEQ ID NO:28, respectively Analogous to the situation witii tiie 
spinach RS precursor, the additional tengtii of tiie cloned spinach LS (relative to its homdogs in microorganisms, 
including yeast) is entirely attributable to an N-termtnal chloroplast transit peptide-like extension (Figure 2). Thus, at 
least the last two steps of higher plant riboflavin biosynthesis take place in chloroplasts. Although immunolocalization 

so and chloroplast protein import experiments (similar to tiiose described above for spinach RS) have demonstrated that 
the spinach LS precursor is also targeted to plastids. it is more difficult in this case to predict with certainty the exact 
start of the mature protein. Even amongst known microbial LS homologs it is apparent that tiie first 15-20 N terminal 
amino add residues are poorly conserved. Ne/ertheless, from the sequence alignments in Figure 2. it is likely that the 
critical cleavage event for maturation occurs between Ala66 and Va}67 of tiie spinach LS precursor, to yield a polypep- 

55 tide witii a predicted molecular mass of approximately 1 6,5 kDa. While this notion remains to be determined experimen- 
tally, tiie predicated amino acid sequence of tiie mature spinach LS based on this assignment is given In SEQ ID N0:2. 
Note that even without its chloroplast targeting sequence, tiie mature spinach LS is only 49%, 47%, 44%. 43% and 29% 
identical to its counterparts in £ coli, A pleuropneumoniae, B. substills, R phosphoreum and S. cerevisiae, respec- 
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tively {e.g., the other proteins shown in Rgure 2). 
EXAMPLES 

Expression of MaHira j^jnnr h LS anri RR ip p ^ ^if 



poses. Primer 12 (ScwS^S^^^TSI^^'S^^Si.^^ ?° '^"^ 

30 sequence as that shown in SEQ ID N02 «™ «™ en«we a polypeptide with the same pnmary amino add 

spinach RS construct and ^tr^li^^JSm<t^^. '^"^ """'^^ ^ and 12 for the mature 

plete(, to Che* to, PCR e.^, ana in bom cases, SST^ ""^ ""^ "™-» ««• com- 

we,ed.s,Se^h"n 1^"^ "^'"^ "°»*"« ^l"'* 



35 



40 



45 



19 

BNSDOCID: <EP 1010760A1 J_> 



EP1010 760A1 



EXAMPLES 

Purrf ication of Recombinant Mature Soinach RS 

5 [0133] An aliquot (0.5 mL) of E coli cell-free extract containing the recombinant spinach RS was rapidly thawed to 
room temperature, diluted 1 : 1 with deionized water, and filtered through a 0.2 \im Aaodisc filter (Qelman Sciences, Cat. 
No. 4192). The entire sample was then applied to a Mono Q HR 5/5 column (Phanmacia Biotech mc), preequilibrated 
at 25 ''C with Buffer Q (50 mM Tris-HCI, pH 7.7. 10 mM sodium sulfite, 1 mM EDTA). The column was developed at 1 .0 
miyn^n wHh a linear gradient (30 mL) of 0-1 .0 M NaCI (in Buffer Q), and 1 *mL fractions were collected. The position on 

10 the gradient where spinach RS elutes was determined by SDS-PAGE (Laemmii U., Nature 227:680-685 (1970)) using 
15% gels and Coomassie Blue staining. Subsequently, column fractions eluting between 0.167-0.233 M NaCI were 
pooled and concentrated in a Centricon-1 0 (Amicon Inc.) at 4 ""C to a f inal volume of 450 In the next step, half of this 
material was applied to a 7.5 x 600 mm TSK G3000SW gel filtration column (TOSOH Corp.) that was preequilibrated 
with Buffer Q containing 0.3 M NaCI. The column was developed at a flow rate of 1 .0 mUmin (25 **C), and highly purified 

IS spinach RS eluted between 15.2-1 6.2 min. The latter was kept on ice while the remaining half of the sample was proc- 
essed in an identical manner. The peak fractions from the two gel filtration columns were pooled, supplemented with 
glyc rol (5%), concentrated to 6.6 mg of protein/mL, and stored at -80 ""C Ibr subsequent use. The yield of purified pro- 
tein was 2.9 mg. corresponding to 13% of the total protein present in the cell-free extract Visual inspection of over- 
loaded Coomassie-stained gets suggested the final preparation was >95% pure. 

20 [0134] Edman degradation of the purified recombinant spinach RS revealed that its first 21 amino adds are kJenti- 
cal to those of the protein shown in SEQ ID N0:8. in accord with the PCR strategy that was employed in its construction. 
This further indicates that the recombinant protein's N-terminus remained intact during the purification procedure. As 
determined by electrospray ionization mass spectrometry, the protomer molecular mass of the purified recombinant 
spinach RS was 22808.3 daitons. a value that is in excellent agreement with that predicted from its DNA sequence 

25 (22807.26 daitons). Similar to the E coli (Bacher et al., J. Bio! Chem, 255:632-637 (1 980)) and yeast (Santos et al., J. 
BioL Chem. 270:437-444 (1995)) RS homologs. both of which are trimers in the native state, the recombinant spinach 
RS eluted djring analytical gel filtration with an apparent molecular mass of 65 kDa. More important, the mature plant 
protein is catalytically active. In the in vitro enzyme assay described below, the purified recombinant spinach RS exhib- 
ited a turnover number of approximately 0.08/sec at 25 '^C (based on protomer). By way of conparison, the reported 

30 turnover numbers for & cerevisiae (Santos et al., J. Biol, Chem. 270:437-444 (1 995)) and B. subtilis (Bacher et al.. J. 
Biol. Cfiem. 255:632-637 (1980)) RS. at 37 *C, are 0.13/sec and 0.33/sec. respectively. Assuming that the enzyme 
reaction is characterized by a Q10 (temperature coefficient) of at least 2, these observations suggest that the purified 
recombinant spinach RS is probably fully active. 

35 EXAMPLE 7 

Purification and Phvsical Pro perties of Recombinant Mature Spinach LS 

[0135] An aliquot (0.5 mL) of E. coll cell-free extract containing the recombinant spinach LS was rapidly thawed to 

40 room temperature, diluted 1 :1 with deionized water, and filtered through a 0.2 ^m Aaodisc filter (Qelman Sciences. Cat. 
No. 4192). The entire sample was then fractionated on a Mono Q HR 5/5 column, using the same buffers and conditions 
that were described atwe for recombinant spinach RS. The material eluting between 0.367-0.433 M NaCI was pooled, 
concentrated to 450 fiL, and subjected to gel filtration chromatography exactly as described above for the recombinant 
spinach RS. Highly purified spinach LS emerged from the sizing column as a sharp peak eluting between 10.15-10.85 

45 min, and this material was supplemented with glycerol (5%), concentrated to 12.1 mg of protein/mL, and stored at -80 
«C for subsequent use. The final yield of purified protein was 4.3 mg (nearly 22% of the total protein present in the cell- 
free extract) and the preparation was essentially homogeneous as judged fronrt Coomassie-stained gels. 
[0136] The nucleotide sequence of the mature recombinant spinach LS precficts a polypeptide of 16534.71 daitons. 
a value that is virtually identical to that which was obtained with the purified protein using electrospray ionization mass 

so spectrometry (16536.3 daitons). Assuming that plant LS forms a hollow, spherical particle, comprised of 60 identical 
subunits. like the £ coli (M6rtl et al.. J. Biol. Chem. 271:33201-33207 (1996)) and B. subtilis (Bacher et al., J. Biol 
Chem. 255:632-637 (1980)) homologs, its native molecular mass should be about 992 kDa. Indeed, during analytical 
gel filtration, the purified recombinant spnach LS exhibited an apparent molecular mass of about 823 kDa It would thus 
appear that the quaternary structure of this nlsof lavtn biosynthetic enzyme has been highly conserved in the evolution 

55 from bacteria to higher plants. Edman degradation confirmed that the first two N-terminal amino acid residues of the 
recombinant protein had been con-ectty altered through PCR from Val-Arg to Met-Asn as previously described. Despite 
these substitutions and removal of tiie chloroplast targeting sequence, the purified recombinant spinach LS still 
retained catalytic activity. At 25 **C. using the in vitro enzyme assay described below, its turnover number was 0.013/sec 
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(based on protomer). This value is in reasonable agreement with the turnover numbers reported for the purified E co//, 
yeast, and S. subtills enzymes (0.06/sec), which were all measured at 37 ""C (Kis et al.. Biochemistry 34:2883-2892 
(1995); MOrtI et al., J, BioL Chem, 271 :33201-33207 (1996)). Thus, it is very likely that the recombinant spinach LS is 
also fully active. 

5 

EXAMPLES 

Preparation of Substrates For RS and LS 

10 [0137] 6.7-Dimethyl-8-(1 -D-ribityOlunnazine (DMRL) was synthesized as previously described (Plaut G. W. E. and 
Harvey R. A., Methods in Ehzymology (McCcrmicK D,B and Wright LD.. eds.) vol. 18. part B. pp. 515-538. Acadentic 
Press, NY (1971)) and purified by HPLC on a C-18 column developed with a water to methanol gradient The purified 
material was taken to dryness in a rotovap and stored at -20 *>C for subsequent use. 

[0138] 4-Ribitylamino-5-amino-2.6-dihydroxypyrimidine (RAADP) was prepared from 4-ribitylamino-5-nitroso-2.6- 
75 dihydroxypyrimidine by catalytic hydrogenation (Plaut, G. W. E. and Harvey R, A.. Methods in Enzymoloov (McCor- 
mick. D.B and Wright LD.. eds.) vol. 18. part B, pp. 515-538. Academic Press. NY (1971)). The 40-mL reaction mixture 
contained 0.4 mmol of the latter compound, dissolved in 10 mM acetic acid, and 20 mg of 10% palladium on caibon. 
Following an overraght incubation period al 25 *>C (with gentle shaking, under 50 psi H2) the catalyst was removed by 
filtration and the filtrate containing RAADP was stored in aiiquots at -80 **C. 
20 [0139] 3,4-Dihydroxybutanone 4«phosphate (DHBP) was prepared enzynnatically from D-ribose 5-phosphate. The 
reaction mixture contained 50 mM Tris-HCI (pH 7.5), 20 mM MgClgr 20 mM D-ribose 5-phosphate, 10 units/mL phos- 
phoriboisomerase (Sigma, Cat Na P7434) and 0.3 units/mL £. coli DHBP synthase. After 2 h at 25 ''C. the reaction 
reached completion and aiiquots of the solution containing DHBP were stored at -80 ''C. 

25 EXAMPLE 9 

Riboflavin Svnthase Assays 

[0140] Riboflavin synthase assays were run using 1 -mL reaction mixtures containing 0.1 mM DMRL in 50 mM Tris- 
30 CI (pH 7.5) at 25 ^'C. Reactions were initiated by the addition of purified recombinant spinach RS and initial rates of the 
reactions were measured continuously at 470 nm. A molar extinction coefficient ( e ) of 9500 at 470 nm was used to cal- 
culate the formation of rilxrflavin (Plaut G- W. E. and Harvey. R. A.. Methods in Enzvmoloay (McCormick. D. B. and 
Wright L D., eds.) vol. 18. part B. pp. 515-538. Academic Press. NY (1971)). 

[0141] Inhibitor screens were carried out in 96-well plates. Potential inhibitory compounds were dissolved in dime- 
35 thy! sulfoxide (DMSO) (10 mg/mL) and then serially diluted with water to concentrations of 0.1 mg/mL in 1% aqueous 
DMSO. Reaction mixtures (0.21 mL total) contained 0.1 15 mL DMRL (0.035 mM) in 100 mM Tris-CI (pH 7.^ and 0.085 
mL potential inhibitory compound (0.1 mg/mL) at 25 •C. Before initiating reactions with enzyme, the absorbance at 470 
nm was recorded. Reactions were Initiated with 0.01 mL of purified recombinant spinach RS (0.28 mg/mL) and after a 
3 mm incubation at 25 °C the plates were read at 470 nm. The first absorbance reading is subtracted from tiie second 
40 to afford rates of riboflavin formation. Column 1 of the 96-well plates contained no compounds and the reactions tn 
these wells served as uninhibited controls. Compounds that reduced the rate of rit>oflavin formation (In comparison to 
the uninhibited oontrol reactions) were followed up with 1-mL confirmation assays where IC50*s were determined. 

EXAMPLE 10 

45 

Uima2;iPe Synthase Assays 

[0142] Lumazine synthase assays were run using 1 -mL reaction mixtures which contain 0.05 mM RAADP and 0,05 
mM DHBP in 50 mM Tris-HCI (pH 75) at 25 *C. Reactions were then initiated by tine addition of purified recombinant 
so spinach LS and initial rates of the reactions were monitored continuously at an absort>ance of 408 nm. A molar extinc* 
tion coefficient (e) of 10,000 at 408 nm was used to calculate the rate of fbmnation of DMRL (Plaut. Q. W. E. and Har- 
vey, R. A.. Methods in Enzymoloov (McConmick. D.B and Wright LD.. eds.) vol. 18. part B. pp. 515-538, Academic 
Press. NY (1971)). 

[0143] Inhibitor screens are carried out in 96-we!i plates. Potential inhibitory compomds are dissolved in 10% 
55 aqueous DMSO to a concentration of 1 mg/mL Reaction mixtures (0.21 mL total) are prepared by adding 0.005 mL 
potential inhibitory compound (1 .0 mg/mL) to 0.195 mL of a solution containing 0.05 mM DHBP, 0.05 mM RAADP in 50 
mM Tris-CI (pH 7.5) at 25 ^C. Reactions are tiien initiated with 0.01 mL of purified recombinant spinach LS (1 .6 mg/mL) 
and progress of the reactions is monitored continuously at 408 nm for 5 min. Column 1 of the 96-well plates contains 
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no test corrpounds and the rates of DMRL formation in these weils serve as uninhibited controls. Compounds that 
reduce the rates of DMRL formation (in comparison to the uninhlDtted controls) are followed up with 1-mL confirmation 
assays where ICSO's are determined. 

5 EXAMPLE 11 

Other Plant and Funoai RS and LS Genes 

[0144] Using ^e methodologies described in Example 4. several other plant and fungal LS and RS cDNAs have 

10 been cloned through functional complementation of the £. coli rSxTflavin auxotrophs. These include the nuclear- 
encoded precursor proteins for arabidopsis RS. tobacco LS, and arabidopsis LS and fuIHength mature RS and LS pro- 
teins from the rice blast fungus Magnaporthe grisea. The nucleotide sequences of Ihe ORFs for these proteins are 
respectively documented in SEQ ID NO:35. SEQ ID N029. SEQ ID SEQ ID N0:11 and SEQ ID NO:37, and 

their con-esponding amino acid sequences are given in SEQ ID NO:36. SEQ ID NO:30. SEQ ID NO:32. SEQ ID NO: 12. 

IS and SEQ ID NO:38. respectively. The Lamba cDNIA expression libraries from which ^e arabidopsis LS and RS genes 
and tobacco LS gene were doned are commercially available from Stratagene (Cat Nos. 937010 and 936002. respec- 
tively). The cDNA expression library from which the Magnaporthe grisea RS gene was obtained was contained in the 
Lanlbda ZipLox vector (GibcoBRL) between the Notl and Sail cleavage sites of the polylinker region, and was prepared 
from isolated mRNA using conventional methodologies (Mardatis). The cDNA expression library from which the Mag- 

20 naporthe grisea LS gene was obtained was cloned between the EcoRI and Xhol sites of the plasmrd vector pBluescript 
II SK (+). available from Stratagene. and was also prepared from isolated mRNA using standard procedures (Maniatis). 
As described above for cloning the spinach LS and RS precursor genes (as in Example 4), the various Lamba cDNA 
expression libraries were first subjected to mass excision to yield plasmid. cDNA libraries, which were then introduced 
into the £ coli LS and RS auxotrophs (W31 10 derivatives) via electroporation. Following this procedure, transformants 

25 were selected for growth in the absence of added riboflavin (on plates containing B agar), and the cDNA inserts of the 
rescuing plasmids were isolated and sequenced completely using custom-designed primers. 
[0145] As shown in Figure 3. the cloned spinach and arabidopsis RS are very similar. Both proteins are synthesized 
as larger molecular weight precursors with a chloroplast targeting sequence at their N-termlnus (boxed residues in Rg- 
ure 3). Although the transit peptides of the two plant species are of conparable length, they are highly divergent and 

3p bear Tittle resemblance to each other at the primary amino sequence level. In contrast, the mature spinach and arabi- 
dopsis RS are well conserved, with nearly 70% of their residues being identical. These observations are not surprising 
since most nuclear-encoded chloroplast proteins, including the precursor for the small subunit of ribuiose-1 ,5-bisphos- 
phate carboxylase-oxygenase (Mazur et a!., Nad, Acids Res. 13:2373-2386 (1985)), exhibit much greater species-to- 
spedes variation in their chloroplast targeting sequence than in the mature portion of the molecule. The two plant RS 

35 homologs also possess a number of polypeptide motifs that are not present in the bacterial and fungal RS homologs 
that Yme currently been sequenced. It is possible that some of these highly conserved regions that are unique to plants 
might specifically influence the catalytic and/or regulatory properties of the higher plant RS. thereby providing valuable 
insight in the design of enzyme inhibitors that could be useful as herbicides. 

10146] A comparison of the spinach, tobacco, and arabidopsis LS precursor proteins, also cloned by functional 
40 complementation, provides a similar picture (Rgure 4). Again, the chloroplast transit peptides of three precursors are 
poorly conserved (boxed residues), while the mature proteins exNbit 72-76% identity at the amino acid sequence level. 
Additionally, all three plant proteins possess a unique stretch of amino acid residues at their C-termlnus (e.g., ASLFE- 
HHLK [SEQ ID NO:39]), a region of the polypeptide that is highly divergent in microbial LS homologs. That these nine 
residues are identical for three different plant species, suggests that they might be of unique importance to the function- 
45 ality of the higher plant proteins. Based on tiiis observation, it is antidpated that further structural and mechanistic stud- 
ies with the purified plant LS proteins described herein will greatly assist the rational design of enzyme inhibitors that 
are specifically herbicidal. 

[0147] Until ^e present invention, the only fungal LS and RS homologs that have been sequenced are those of S. 
cervasiae (Garcia-Ramirez etal.. J. Biol. Cliem. 270:23801-23807 (1995)}; Santos et al., J, BioL Chem. 270:437-444 

so (1 995)) . These proteins bear little resemblence to the Magnaportfie grisea LS and RS proteins that were cloned in the 
present work by functional complementation. The two fungal RS homologs are only 47 % identical at the primary amino 
acid sequence level. While a similar degree of conservation is observed between the Magnaportfie grisea and S. 
cerevesiae LS proteins (51% identity), the former is significantly longer than other known microbial homologs. Moreo- 
ver, the additional length of this protein is not due to tiie presence of a cleavable N-temriinal targeting sequence as 

55 described above for the higher plant LS and RS precursors. Instead, it possesses a unique polypeptide segment that 
appears to have been inserted right about in the middle of the protein; based on sequence alignments with other LS 
homologs. the additional residues (consisting largely of Ser and Thr) conrespond to amino adds 74-104 of the Mag- 
naporthe grisea LS (SEQ ID NO:38). The significance of this observation is not yet understood, but it could have prac- 
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tical application in the development of novel fungicides for use in the treatment of rice blast 

SEOX^ENCE LISTING 
(1) GEHERAL INFORMATION; 

(i) APPLICANT: 

(A) ADDRESSEE: E. X. DU PONT DB NEMOURS AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITV: WILMINGTON 

(D) STATE : DELAWARE 

(E) COONTRY: UNITED STATES OF AMERICA 

(F) POSTAL CODE: 19898 

75 (ii) TITLE OF INVENTION: LDMA2INE SYNTHASE AND 

RIBOFLAVIN SYNTHASE 

(ill) NUMBER OF SEQUENCES: 39 

20 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: WORDPERFECT €.1 



2S 



45 



SO 



(2) UTFORMATION FOR SEQ ID N0:1: 



(i) SEQUENCE CHTJy^CTERISTICS : 

<A} LENGTH: 471 base pairs 
<B> TYPE: nucleic acid 
^ KC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDKA 

35 Uii) HYPOTHETICAL: NO ' 

(iv) ANTI- SENSE: NO 

(vt) ORIGINAL SOURCE: 

40 (C) INDIVIDUAL ISOLATE: mature spinach LS 

(Xi) SEQUtNCE DESCRIPTION: SEQ ID NO : 1 : 

GTTAGGGAGC TTGAAGGTTA TGTCACTJ\AA GCCCAGAGTT TCAGATTTGC CATTGTTGTG 60 
GCTAGGTTCA ACGAATTTGT GACAAGACGA CTAATGGAAG GAGCTCTTGA CACTTTTAAG 120 
AAATACTCTG TCAATGAAGA TATTGATGTT GTTTGGGTTC CTGGTGCTTA TGAGCTAGGT 180 
GTTACTGCAC AAGCACTTGG GAAATCAGGA AAATATCATG CTATTGTTTG TCTTGGAGCT 240 
GTGGTAAAAG GTGATACTTC ACACTATGAT GCTGTCGTTA ATTCTGCTTC CTCTGGAGTA 300 
CTGTCAGCTG GATTAAATTC AGGAGTACCT TGTGTCTTTG GTGTCCTTAC CTGTGATAAC 360 

55 
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15 



20 



25 



30 



.35 



40 



45 



SO 



ATGGATCAGG CCATAAATCG AGCTGGCGGG AAAGCGGGTA ATAAAGGAGC CGAGTCAGCG 420 
CTAACAGCTA TTGAAATGGC TTCGCTTTTC GAACATCATT TGAAGGCCTA A 471 
<2) INFORMATION FOR SEQ ID NO:2: 

(1) SEQUENCE CR;IRACT£RISTICS : 

(A) LENGTH: 1S6 amino acids 

(B) TYPE: amino acid 

(C) STRANDCDNESS : unknown 

(D) TOPOLOGY : imknown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: mature spinach LS 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Val Arg Glu Leu Glu Gly Tyr Val Thr Lys Ala Gin Ser Phe Arg Phe 
15 10 15 

Ala lie Val Val Ala Arg Phe Asn Glu Phe Val Thr Arg Arg Leu Met 
20 25 30 

Glu Gly Ala Leu Asp Thr Phe Lys Lys Tyr Ser Val Asn Glu Asp lie 
35 40 45 

Asp Val Val Trp Val Pro Gly Ala Tyr Glu Leu Gly Val Thr Ala Gin 
50 55 60 

Ala Leu Gly Lys Ser Gly Lys Tyr His Ala He Val Cys Leu Gly Ala 
65 70 75 80 

Val Val Lys Gly Asp Thr Ser His Tyr Asp Ala Val Val Asn Ser Ala 
85 90 95 

Ser Ser Gly Val Leu Ser Ala Gly Leu Asn Ser Gly Val Pro Cys Val 
100 , 105 - 110 

Phe Gly Val Leu Thr Cys Asp Asn Met Asp Gin Ala He Asn Arg Ala 
lis 120 125 

Gly Gly Lys Ala Gly Asn Lys Gly Ala Glu Ser Ala Leu Thr Ala He 
130 135 140 

Glu Met Ala Ser Leu Phe Glu His His Leu Lys Ala 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 3: 



55 
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(1) SEQUENCE CHARACTERISTICS: 

(A) liEKGTH: 474 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doiibl e 
<D) TOPOLOGY: linear 

(ii> MOLECOl^ TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

75 <C) INDIVIDUAL ISOLATE: macure tobacco LS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTTCGTCAGT TGACTGGTTC TGTTACCTCT GCCAAAGGCC ATCGCTTTGC TGTTGTGGTT 60 
GCAC6TTTCA ATGATCTTAT CACCAAGAAG CTTTTGGAGG GAGCTTTGGA CACTTTCAAA 120 
AATTACTCTG TTAGA6AGQA AGATATTGAT GTC6TGTGGG TTCCTGGCTG TTTTGAAATC 180 
GGTGTGGTTG CGCAACAGCT TGGAAA6TCG CAGAAATATC AAGCAATACT CTGTATTGGG 240 
GCTGTGATTA GAGGTGATAC GTCTCACTAT GATGCCGTCG TTAATGCTGC CACATCCGGA 300 
GTACTTTCAG CAGGTCTAAA TTCTGGTACT CCTTGCATAT TTGGTGTTTT GACATGTQAT 360 
^ ACCTTGGA6C AGGCTTTCAA TCGTGTCGGT GGGAAGGCTG GGAATAAAGG TGCCGAAACA 420 

GCGTTGACAG CTATTGAGAT GGCGTCTTTG TTTGAACACC ACTTAAAGGC TTAA 474 

(2) INFORMATION FOR SEQ ID NO:4: 

35 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 157 amino acids 
{B) TYPE: amino acid 
(C) STRANDEDNESS : noH relevant 
40 (D) TOPOLOGY: not relevauat 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL S0T3RCE: 

(C) INDIVIDUAL, ISOLATE: mature tobacco LS 

(xi) SEQUENCE DESCH^IPTION: SEQ ID N0:4: 

Val Arg Gin Leu Thr Gly Ser Val Thr Ser Ala Lys Gly His Arg Phe 
15 10 15 

Ala Val Val Val Ala Arg Phe Asn Asp Leu He Thr Lys Lys Leu Leu 
20 25 30 

ss 
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Glu Gly Ala Leu Asp Thr Phe Lys Asn Tyr Ser Val Arg Glu Glu Asp 
35 40 45 

lie Asp Val Val Trp Val Pro Gly Cys Phe Glu He Gly Val Val Ala 
50 55 60 

Gin Gin Leu Gly Lys Ser Gin Lys Tyr Gin Ala lie Leu Cys He Gly 
65 70 75 80 

Ala Val He Arg Gly Asp Thr Ser His Tyr Asp Ala Val Val Ash Ala 
85 90 95 

Ala Thr Ser Gly Val Leu Ser Ala Gly Leu Asn Ser Gly Thr Pro Cys 
100 105 110 

He Phe Gly Val Leu Thr Cys Asp Thr Leu Glu Gin Ala Phe Asn Arg 
115 120 125 

val Gly Gly Lys Ala Gly Asn Lys Gly Ala Glu Thr Ala Leu Thr Ala 
130 135 140 

He Glu Met Ala Ser Leu Phe Glu His His Leu Lys Ala 
145 150 155 

(2) XNFORMATIOIX FOR SSQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE': cDNA 

( iii ) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) IipXVIDUAL ISOLATE: mature arabidopsis LS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTTCGCCATG TTACGGGGTC TCTTATCAGA GGCGAAGGTC TTAGATTCGC CATCGTGGTA 60 
GCTCGTTTCA ATGAGGTTGT GACTAAGTTG CTTTTGGAAG GAGCGATTGA GACTTTCAAG 120 
AAGTATTCAG TCAGAGAAGA AGACATTGAA GTTATTTGGG TTCCTGGCAG CTTTGAAATT 180 
GGTGTTGTTG CACAAAATCT TGGGAAATCG GGAAAATTTC ATGCTGTTTT ATGTATCGGC 240 
GCTGTGATAA GAGGAGATAC CACACATTAT GATGCTGTTG CCAACTCTGC TGCGTCTGGA 300 
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GTACTTTCTG CTAGCATAAA TTCAGGCGTT CCATGCATAT TTGGTGTACT GACTTGCX3AG 360 
6ACATGGATC AGGCTCT6AA TCGATCTGGT GGCAAA6CCG 6CAATAAGGG A6CT6AAACT 420 

5 

GCTTTGACGG CGCTCQAAAT GGCGTCGTTG TTTGAGCACC ACCTGAAATA G 471 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEOtJENCE CHARACTERISTICS: 

(A) IiENGTH: 156 amino acids 

(B) TYPE: ' amino acid 

(C) 5TRANDEDNESS : tui3cnown 
(D> TOPOLOGY: unknown 

75 

<ii) MOLECULE TYPE: protein 

(Vi) ORIGINAL SOURCE: 

(C) XNDIVIDXIAL ISOLATE: mature arabidopsis LS 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

val Arg His Val Thr Gly Ser Leu lie Arg Gly Glu Gly Leu Arg Phe 
15 10 15 

Ala lie Val Val Ala Arg Phe Asn Glu Val Val Thr Lys Leu Leu Leu 
20 25 30 

Glu Gly Ala lie Glu Thr Phe Lys Lys Tyr Ser Val Arg Glu Glu Asp 
35 40 45 

lie Glu Val lie Txp Val Pro Gly Ser Phe Glu lie Gly Val Val Ala 
50 55 60 

^ Gin Asn Leu Gly Lys Ser Gly Lys Phe His Ala Val Leu C/s He Gly 

65 70 75 80 

Ala Val lie Arg Gly Asp Thr Thr His Tyr Asp Ala Val Ala Asn Ser 
85 SO 95 



20 
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Ala Ala Ser Gly Val Leu Ser Ala Ser He Asn Ser Gly Val Pro Cys 
loo > 105 110 

He Phe Gly Val Leu Thr Cys Glu Asp Met Asp Gin Ala Leu Asn Arg 
115 120 125 

Ser Gly Gly Lys Ala Gly Asn Lys Gly Ala Glu Thr Ala Leu Thr Ala 
130 135 140 

Leu Glu Met Ala Ser Leu Phe Glu His His Leu Lys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 7: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mature Spinach RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGTTCACT6 GCATTGTTGA AGAGATTGGC CGAGTTAA6C AAATGGGTTA TGGCGAAGAC SO 
GGTGGATTTC AGCTTAAAGT TGTAGGAGAC ATTGTCCTAA AAGATGTCAA TCTTGGTGAC 120 
AGTATCGCAG TTAATGGTAC ATGTCTAACT GTGACGGAAT TTGACACTAA AGCGTCCGAA 180 
TTTACTCTTG GGATAGCGCC TGAGACGCTT AGGAAGACGG CATTGATGGA TCTCGAACCA 24 0 
GGGTCAGTTG TTAATTTAGA AAGAGCCCTT TTGCCTTCTA CACGGATGGG TGGTCACTTT 300 
GTCCAGGGAC ATGTTGATGG GACAGGAGAA ATTGTATCAC TAGTTGAAGA AGGTGATTCT 360 
TTGTGGGTCA AGATAAAAAC AAGCCCAGAA ATACTGAGAT ACATTGTACC AAAAGGGTTT 420 
ATTGCAATTG ATGGCACAAG TTTAACAGTG GTGGATGTGT TTGACCAAGA ATTATGCTTT 480 
AATATTATGT TAGTTGCTTA CACTCAACAA AATGTGGTCA TTCCACTCAA AAAAGTTGGC 540 
CAAAAGGTTA ATTTAGAGGT TGATATTCTA GGAAAATATG TGGAAAGGCT CCTAAGTAGT 600 
AGTGGGGTtT TGGATCCTAC CAAATTCACA TAG 633 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CTARACTERISTICS : 

<A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
CD) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOIATE: Mature Spinach RS 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO;8: 

Met Phe Thr Gly He Val Glu Glu He Gly Arg Val Lys Gin Met Gly 

Tyr Gly Glu Asp Gly Gly Phe Gin Leu Lys Val Val Gly Asp He Val 
20 25 30 

Leu Lys Asp Val Asn Leu Gly Asp Ser He Ala Val Asn Gly Thr Cvs 
35 40 45 

Leu Thr Val Thr Glu Phe Asp Thr Lys Ala Ser Glu Phe Thr Leu Gly 
50 55 60 

He Ala Pro Glu Thr Leu Arg Lys Thr Ala Leu Met Asp Leu Glu Pro 
" 70 75 80 

Gly Ser Val Val Asn Leu Glu Arg Ala Leu Leu Pro Ser Thr Arg Met 
fiS 90 95 

Gly Gly His Phe Val Gin Gly His Val Asp Gly Thr Gly Glu He Val 
100 105 no 

Ser Leu Val Glu Glu Gly Asp Ser Leu Trp Val Lys He Lys Thr Ser 
115 120 125 

Pro Glu He Leu Arg Tyr He Val Pro Lys Gly Phe He Ala He Asp 
130 135 140 

Gly Thr Ser Leu Thr Val Val Asp Val Phe Asp Gin Glu Leu Cys Phe 
"5 ISO 155 160 

Asn He Met Leu Val Ala. Tyr Thr Gin Gin Asn Val Val He Pro Leu 
165 170 175 

Lys Lys Val Gly Gin Lys Val Asn Leu Glu val Asp He Leu Gly Lys 
180 185 190 

Tyr Val Glu Arg Leu Leu Ser Ser Ser Gly Val Leu Asp Pro Thr Lys 
195 200 205 

Phe Thr 
210 

(2) INFORMATION FOR SEQ ID NO: 9: ^ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiAhle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTl- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) IMDIVIDX7AL ISOLATE: Mature arabidopsis RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTGTTTACTG GAATCGTGGA GGAAATGGGT GAAGTCAAGG ACTTGGGAAT GGCCGATCAC 60 
GGAGGATTCG ACCTCAAAAT CGGAGCGAGA GTGGTGTTAG AGGACGTGAA GCTCGGTGAC 120 
AGTATCGCCG TGAACGGTAC TTGTTTAACG GTGACGGAGT TTAACGCAGA GGAGTTCACA 180 
GTAGGGTTAG CACCGGAGAC GCTGAGAAAA ACATCGTTGG AGGAGTTAAA GAAAGGATCT 240 
CCGGTGAATC TGGAGCGTGC GTTGCAGCCA GTGAGCAGGA TGGGTGGACA CGTGGTTCAG 300 
GGACACGTGG ATOGGACGGG AGTGATTGAA TCAATGGAGG TAGAGGGTGA TTCTTTGTGG 260 
GTGAAGGTGA AAGCTGACAA GGGTTTGTTG AAATACATTG TGCCTAAAGG ATTTGTGGCT. 420 
GTTGATGGGA CTAGCTTGAC GGTTGTTGAT GTGTTTGATG AAGAGAGCTG TTTCAATTTC 480 
ATGATGATTG CTTATACGCA ACAGAATGTA GTGATTCCGA CTAAGAAGAT TGGGCAGAAA 540 
GTGAATCTTG AGGTTGATAT CATGGGGAAG TATGTTGAGA GGCTTCTCAC CAGTGGTGGC 600 
TTCTCCAAAG GAAAAGAAAA TATTTGA 627 
(2) INFORMATION FOR SBQ ID NO: 10: 

(i) SEQUENCE CKARACTERIoTICS : 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY : not relevant 

(ii) MOLECULE^ TYPE : protein 

(Vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Mature arabidposis RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Phe Thr Gly lie Val Glu Glu Met Gly Glu VaX Lys Asp Leu Gly 
15 10 15 

Met Ala Asp His Gly Gly Phe Asp Leu Lys lie Gly Ala Arg Val Val 
20 25 30 
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Leu Glu Asp Val Lys Leu Gly Asp Ser lie Ala Val Asn Gly Thr Cys 
35 40 45 

Leu Thr Val Thr Glu Phe Asn Ala Glu Glu Phe Thr Val Gly Leu Ala 
50 55 60 

Pro Glu Thr Leu Arg Lys Thr Ser Leu Glu Glu Leu Lys Lys Gly Ser 
65 70 75 80 

Pro Val Asn Lieu Glu Arg Ala Leu Gin Pro val ser Arg Mec Gly Gly 
85 90 95 

His Val Val Gin Gly His Val Asp Gly Thr Gly Val lie Glu Ser Met 
100 105 110 

Glu Val Glu Gly Asp Ser Leu Trp Val Lys Val Lys Ala Asp Lys Gly 
115 120 125 

Leu Leu Lys Tyr lie Val Pro Lys Gly Phe Val Ala Val Asp Gly Thr 
130 135 140 

Ser Leu Thr Val Val Asp Val Phe Asp Glu Glu Ser Cys Phe Asn Phe 
145 ISO 155 160 

Met; Met He Ala Tyr Thr Gin Gin Asn Val Val He Pro Thr Lys Lys 
165 170 175 

He Gly Gin Lys Val Asn Leu Glu Val Asp He Met Gly Lys Tyr Val 
180 185 190 

Glu Arg Leu Leu Thr Ser Gly Gly Phe Ser Lys Gly Lys Glu Asn He 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : do\lble 

(D) TOPOLOGY: linear 
« 

(ii) MOLECULE TYPE: CDKA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: M. grisea RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGTTCACTG GTATAGTCGA GGAGATCGGA GTCGTGGCCG AGCTCAACCC GCACGATGCC 
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ACTGGAGGGA CGTCATTGAC CATCTCGCTC CCGACGGGCA 6CAGCCTGCT CTCGGATTGC 120 
CACQACGGTG ATAGCATOGC CGTC3WICGGT GTGTGCCTSA CCGTCACATC CTTCACGCCQ 180 
ACGCAGTTCA CAGTCGGTGT TGCCCCGGAG ACGCTGCGCG TCACGGACCT GGGCGACCTG 240 
GTCAAGGACT CGCGCGTGAA CCTGGAGCGA GCCGTGCGCG CCGACACTCG CATGGGCGGT 300 
CACTTTGTAC AGGGCCACGT CGACACGACC GCCACCATAG CCGACAAGCA GGCAGATG6T 360 
* AACGCCGTCA CGATQCGGTT CAAGCCACGG GAGGGTAGCG ATGTGTTGAA GTACATCX3TG 420 
CGAAAGGGTT ATGTCGCATT GGACGGAACC AGCTTGACGG TTACTAAGGT CGACGACOCT 480 
GCCGGGTGGT GGGAGGTCAT GCTCATCGTT TACACGCAGG AACGTGTGGT CCTGGCGCAG 540 
AAGAACGTTG GTGATACTGT CAATGTCGAG GTTGACGTCT TGGCCAAGTA TGCTGAGAAG 600 
AGTATGGCTG GATACTTGAG CTCTCTCAAC AAGAGTGACG CATAA 645 
(2) INFORMATION FOR SEQ lO NO: 12: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: M. grisea RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Phe Thr Gly He Val Glu Glu He Gly Val Val Ala Glu Leu Asn 
15 - 10 . 15 

Pro His Asp Ala Thr Gly Gly Thr Ser Leu Thr He Ser Leu Pro Thr 
20 25 30 

Gly ser Ser Leu Leu Ser Asp Cys His Asp Gly Asp Ser He Ala Val 
35 .40 45 

Asn Gly Val Cys Leu Thr Val Thr Ser Phe Thr Pro Thr Gin Phe Thr 
50 55 60 

Val Gly Val Ala Pro Glu Thr Leu Arg Val Thr Asp Leu Gly Asp Leu 
65 70 75 80 

Val Lys Asp Ser Arg val Asn Leu Glu Arg Ala Val Arg Ala Asp Thr 
85 90 95 
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Arg Met Gly Gly His Phe Val Gin Gly His Val Asp Tbr Thr Ala Thr 
XOO 105 110 

lie Ala Asp l^ya Gin Ala Asp Gly Asn Ala Val Thr Met Arg Phe Lys 
115 120 125 

Pro Arg Glu Gly Ser Asp Val L.eu Lys Tyr lie Val Arg I*ys Gly Tyr 
130 135 140 

Val Ala I.eu Asp Gly Thr Ser Leu Thr Val Thr I*ys Val Asp Asp Ala 
145 150 155 160 

Ala Gly Trp Trp Glu Val Met Leu He Val Tyr Thr Gin Glu Arg Val 
les 170 175 

Val Leu Ala Gin Lys Asn Val Gly Asp Thr Val Asn Val Glu Val Asp 

180 185 190 

Val Leu Ala Lys Tyr Ala Glu Lys Ser Met Ala Gly Tyr Leu Ser Ser 
195 200 205 

Leu Asn Lys Ser Asp Ala 
210 

(2) INFORMATION FOR SEQ ID NO: 13: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDfiDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc - "primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

*(C) INDIVIDUAL ISOLATE: 5' E. coli LS primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGAAGGAAGA CCATGGCCAT TATTGAAGCT AACGTTGC 3 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECUIiE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « "primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIPUAL ISOLATE: 3* E. coli LS primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATCTTACTGT CGACTTCAGG CCTTGATGGC TTTC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc » "primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi> ORIGINAL SOURCE,; 

(C) INDIVIDUAL ISOLATE: 5' E, coLi RS primer 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO; 15: 

ACTCATTTAC CATGGCTACG GGGATTGTAC AGGGC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI^VESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

<A) DESCRIPTION: /desc « -primer" 

(iii) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 
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(vi) ORIGIiaAL SOURCE: 

(C> IKDIVIDUAL ISOIATE: 3» primer E. coli RS 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

ATCTTACTGT CGACTTCAGG CTTCTGTGCC TGGTT 35 
(2) INFORMATION FOR SEQ ID N0:17: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
IS (D) TOPOl-OGY: linear 

(ii) MOLECULE TYPE: otiher nucleic acid 

(A) DESCRIPTION: /desc « "primer" 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE; 

25 (C) INDIVIDUAL ISOLATE: 5» E. coli RS and LS primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

AACTAGATCA GCGGCC6CAG CCACGTTGTG TCTCAAA 37 

(2) INFORMATION FOR SEQ ID NO: 18: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: 3» E. coli LS and RS primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GACAAACATA GCGGCCGCTG AGGTCTGCCT CGTGAA 36 

(2) INFORMATION FOR SEQ ID N0:19: 
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(i) SBQtTENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEOKESS : single 

(D) TOPOIOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc - "primer" 

<iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE; NO 



(vi) ORIGINAL SOORCE: 

(C) INDIVIDUAL ISOLATE: E. coli LS primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



AACTAGATCA GCGGCCGCG6 TACGGTTATT CGTGGT 
(2) INFORMATION . FOR SSQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "primer" 

(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 

(vi) . ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: E. coli LS primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GACAAACATA GC(3GCCGCGT CGTATTTACC GGT 

(2). INFORMATION FOR SEQ ID NO: 21: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: liixear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "primer" 
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(iii) HYPOTHETICAL: NO 

(iv) AKTZ -SSNSE: KO 

<vi) 0RI6IKAL SOURCE: 

(C) INDIVIDUAL ISOLATE: £. coli RS primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AACTAGATCA GCGGCCGCAC CACTGCTGAA GTGGC 

(2) INFORMATION FOR SEQ ID NO: 22: 

IS (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: E. coli RS primer 

^0 (xi) SEQUENCE DESCRI3?TI0N: SEQ ID NO: 22: 

GACAAACATA GO^GCCGCGA CCTGACATTA AGTGTCC 

(2) INFORMATION FOR SEQ ID NO:23: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « "primer" 
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(iii) HYPOTHETICAL: NO. 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: 5' primer spinach RS 

precursor 

(xi) SEQtXENCE DESCRIPTION: SSQ ID NO: 23: 
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CTACTCATTT CATATGTTCA CTGGCATTGT TGAA 
(2) INFORMATIOtr FOR SSQ 10 KO:24: 

(1) SEQUENCE CHAJiACTERISTICS : 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « •primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: 3» spinach RS precursor 

primer 

(xi) SEQUENCE DESCRIPTION: SEC ID NO: 24: 

CATCTTACTG GATCCACTAT GTGAATTTGG TAGGATC 

(2) INFORMATION FOR SEQ ID NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLiECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc - "primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: 5* spinach LS precursor 

primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CTACTCATTT CATATGAACG AGCTTGAAGG TTATGTCAC 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNSSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « "primer" 

<iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) ZNDZVIDT7AL ISOLATE: 3* primer spinach LS 

precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CATCTTACTG GATCCATCAfi GCCTTCAAAT GATGTTCG 38 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) XiENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: spinach LS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATOGCTTCAT TTOCAGCTTC TCAAACTTGT TTCCTGACAA CAAACCCCAC TTGTTTAAAA 60 
CCCAATTCCC CTCAAAAATC TTCCACATTT CTTCCATTTT CTGCCCCTCT TTCTTCCTCG 120 
TCATCTTTCC CTGGTTGTQG GTTGGTTCAT GTTGCATCAA ACAAGAAAAA TCGTGCTTCG 180 
TTTGTAGTGA CCAATGCTGT TAGGGAGCTT GAAGGTTATG TCACTAAAGC CCAGAGTTTC 240 
AGATTTGCCA TTGTTGTGGC TAGGTTCAAC GAATTTGTGA CAAGACGACT AATGGAAGGA 3 00 
GCTCTTGACA CTTTTAAGAA ATACTCTGTC AATGAAGATA TTGATGTTGT TTGGGTTCCT 360 
GGTGCTTATG AGCTAGGTGT TACTGCACAA GCACTTGGGA AATCAGGAAA ATATCATGCT 420 
ATTGTTTGTC TTGGAGCTGT GGTAAAAGGT GATACTTCAC ACTATGATGC TGTCGTTAAT 400 
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TCTGCTTCCT CTGGAGTACT GTCAGCTGGA TTAAATTCAG GAGTACCTTG TGTCTTTGGT 540 
GTCCTTACCT GTGATAACAT GGATCAGGCC ATAAATCGA6 CTGGCGGGAA AGCGGGTAAT 600 
AAAGGAGCCG AGTCAGCGCT AACAGCTATI GAAATGGCTT CGCTTTTCGA ACATCATTTG .660 
AAGGCCTAA 669 
(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRAKDEDNBSS ; not relevant 

(D) TOPOLOGY: not relevant 

(li) MOLECUI£ TYPE: protein 

20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDtXAL ISOLATE: spinach LS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

25 Met Ala Ser Phe Ala Ala Ser Gin Thr Cys Phe Leu Thr Thr Asn Pro 

15 10 15 

Thr Cys Leu Lys Pro Asn Ser Pro Gin Lys Ser Ser Thr Phe Leu Pro 
20 25 30 



Phe Ser Ala Pro Leu Ser Ser Ser Ser Ser Phe Pro Gly Cys Gly Leu 
35 40 45 

Val His Val Ala Ser Asn Lys Lys Asn Arg Ala Ser Phe Val Val Thr 
50 55 60 

Asn Ala Val Arg Glu Leu Glu Gly Tyr Val Thr Lys Ala Gin Ser Phe 
65 70 75 80 

Arg Phe Ala He Val Val Ala Arg Phe Asn Glu Phe Val Thr Arg Arg 
85 90 95 

Leu Met Glu Gly Ala Leu Asp Thr Phe Lys Lys Tyr Ser Val Asn Glu 
100 105 * 110 

Asp He Asp Val Val Trp Val Pro Gly Ala Tyr Glu Leu Gly Val Thr 
lis 120 125 

Ala Gin Ala Leu Gly Lys Ser Gly Lys Tyr His Ala He Val Cys Leu 
130 135 140 

Gly Ala Val Val Lys Gly Asp Thr Ser His Tyr Asp Ala Val Val Asn 
145 150 155 160 
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Ser Ala Ser Ser Gly Val lieu Ser Ala Gly Leu Asn Ser Gly Val Pro 
165 170 175 

Cys Val . Phe Gly Val Leu Thr Cys Asp Asn Met Asp Gin Ala He Asn 
180 185 190 

Arg Ala Gly Gly Lys Ala Gly Asn Lys Gly Ala Glu Ser Ala Leu Thr 
195 200 205 

Ala lie Glu Met Ala Ser Leu Phe Glu His His Leu Lys Ala 
210 215 220 

(2) INFORMATION FOR SEQ ID N0:29: 

(1) SEQUENCE CHARACTBRISTICS : 

(A) XiENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: tobacco LS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TTCGCTTTCG GACAGTGCAA TCTTCTACCT CGTACAACAA CTGTAAATCC CACACAACTG 60 
CACTCTCCTC TTTACTCTTT GTCTTTGCCT TTCCACAGAC AAAGCATAAC CTCTTCACCT 12 0 
GCACTATCAT TCACCCAATC TCAAGGTTTA GGGTCTGCAA TTGAGAGACA TTOCGACCGG 180 
TCG6ATCTGT TTCAAACATG TGCTGTTCGT CAGTTGACTG GTTCTGTTAC CTCTGCCAAA 240 
GGCCATCGCT TTGCTGTTGT GGTTGCACGT TTCAATGATC TTATCACCAA GAAGCTTTTG 300 
GAGGGAGCTT TGGACACTTT CAAAAATTAC TCTGTTAGAG AGGAAGATAT TGATGTCGTG 360 
TGGGTTCCTG GCTGTTTTGA AATCGGTGTG GTTGCGCAAC AGCTTGGAAA GTCGCAGAAA 420 
TATCAAGCAA TACTCTGTAT TGGGGCTGTG ATTAGAGGTG ATACGTCTCA CTATGATGCC 480 
GTCGTTAATG CTGCCACATC CGGAGTACTT TCAGCAGGTC TAAATTCTGG TACTCCTTGC 540 
ATATTTGGTG TTTTGACATG TGATACCTTG GAGCAGGCTT TCAATCGTGT CGGTGGGAAG 600 
GCTGGC3AATA AAGGTGCCGA AACAGCGTTG ACAGCTATTG AGATGGCGTC TTTGTTTGAA 660 
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CACCACTTAA AGGCTTAA 
(2) INFORMATION FOR SEQ ID NO: 30; 

Ci) SEOXJENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: tobacco LS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Phe Ala Phe Gly Gin Cys Asn Leu Leu Pro Arg Thr Thr Thr Val Asn 
15 10 IS 

Pro Thr Gin Leu His Sex Pro Leu Tyr Ser Leu Ser Leu Pro Phe His 
20 25 30 

Arg Gin Ser lie Thr Ser Ser Pro Ala Leu Ser Phe Thr Gin Ser Gin 
35 40 45 

Gly Leu Gly Ser Ala lie Glu Arg His Cys Asp Arg Ser Asp Leu Phe 
50 55 60 

Gin Thr Cys Ala val Arg Gin Leu Thr Gly ser val Thr ser Ala Lys 

€5 . 70 .75 80 

Gly His Arg Phe Ala Val val Val Ala Arg Phe Asn Asp Leu He Thr 
85 90 95 

Lys Lys Leu Leu Glu Gly Ala Leu Asp Thr Phe Lys Asn Tyr Ser Val 
100 105 110 

Arg Glu Glu Asp He Asp Val Val Trp Val Pro Gly Cys Phe Glu He 
115 120 125 

Gly Val Val Ala Gin Gin Leu Gly Lys Ser Gin Lys Tyr Gin Ala He 
130 135 140 

Leu Cys He Gly Ala Val He Arg Gly Asp Thr Ser His Tyr^ Asp Ala 
14i5 150 155 160 

Val Val Asn Ala Ala Thr Ser- Gly Val Leu Ser Ala Gly Leu Asn Ser 
165 170 17S 

Gly Thr Pro Cys He Phe Gly Val I-eu Thr Cys Asp Thr Leu Glu Gin 
180 185 190 
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Ala Phe Asn Arg Val Gly Qly Vys Ala Gly Asn Lya Gly Ala Glu Thr 
195 200 205 

Ala Leu Thr Ala lie Glu Met Ala ser Leu Phe Glu His His Leu Lys 
210 215 220 

Ala 
225 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) . SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 

(iii) HYPOTHETICAL: NO 

(iv) ANTX- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: arabidopsis LS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATGAAGTCAT TAGCTTCGCC GCCGTGTCTC CGCCTGATAC CGACGGCACA CCGTCAGCTC 60 
AATTCGCGTC AATCTTCCTC CGCCTGTTAT ATACACGGTG GCTCTTCTGT GAACAAATCC 120 
AATAATCTCT CATTCTCCTC ATCCACATCC GGATTTGCGT CACCACTAGC TGTAGAGAAG 180 
GAATTACGCT CTTCATTCGT ACAGACGGCT GCTGTTCGCC ATGTTACGGG GTCTCTTATC 240 
AGAGGCGAAG GTCTTAGATT CGCCATCGTG GTAGCTCGTT TCAATGAGGT TGTGACTAAG 300 
TTGCTTTTGG AAGGAGCGAT TGAGACTTTC AAGAAGTATT CAGTCAGAGA AGAAGACATT 360 
GAAGTTATTT GGGTTCCTGG CAGCTTTGAA ATTGGTGTTG TTGCACAAAA TCTTGGGAAA 420 
TCGGGAAAAT TTCATGCTGT TTTATGTATC GGCGCTGTGA TAAGAGGAGA TACCACACAT 480 
TATGATGCTG TTGCCAACTC TGCTGCGTCT GGAGTACTTT CTGCTAGCAT AAATTCAGGC 540 
GTTCCATGCA TATTTGGTGT ACTGACTTGC GAGGACATGG ATCAGGCTCT GAATCGATCT 600 
GGTGGCAAAG CCGGCAATAA GGGAGCTGAA ACTGCTTTGA CGGCGCTCGA AATGGCGTC6 660 
TTGTTTGAGC ACCACCTGAA ATAG 684 
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(2) IMFOIOIATXON FOR S£Q ID KO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOIiECULE TYPE: protein 

(vi) ORIGINAL S0X3RCE: 

(C) INDIVIDUAL ISOLATE: arabidopsis LS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Lys Ser Leu Ala Ser Pro Pro Cys Leu Arg Leu lie Pro Thr Ala 
15 10 15 

His Arg Gin Leu Asn Ser Arg Gin Ser Ser Ser Ala Cys Tyr lie His 
20 25 30 

Gly Gly Ser Ser Val Asn Lys Ser Asn Asn Leu Ser Phe Ser Ser Ser 
35 40 45 



Thr Ser Gly Phe Ala Ser Pro Leu Ala Val Glu Lys Glu Leu Arg Ser 
50 55 60 

Ser Phe Val Gin Thr Ala Ala Val Arg His Val Thr Gly Ser Leu lie 
65 70 75 80 

Arg Gly Glu Gly Leu Arg Phe Ala lie Val Val Ala Arg Phe Asn Glu 
85 90 95 

Val Val Thr Lys Leu Leu Leu Glu Gly Ala lie Glu Thr Phe Lys Lys 
100 105 . 110 

Tyr Ser Val Arg Glu Glu Asp lie Glu Val lie Trp Val Pro Gly Ser 
115 120 125 

Phe Glu lie Gly Val Val Ala Gin Asn Leu Gly Lys Ser Gly Lys Phe 
130 * 135 140 

His Ala Val Leu Cys lie Gly Ala Val lie Arg Gly Asp Thr Thr His 
145 150 155 160 

Tyr Asp Ala Val Ala Asn Ser Ala Ala Ser Gly Val Leu Ser Ala Ser 
165 170 175 

lie Asn Ser Gly Val Pro Cys lie Phe Gly Val Leu Thr Cys Glu Asp 
180 185 190 

Met Asp Gin Ala Leu Asn Arg Ser Gly Gly Lys Ala Gly Asn Lys Gly 
195 200 205 
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Ala Glu Thr Ala Leu Thr Ala Leu Glu Met Ala Ser Leu Phe Glu His 
210 215 220 

His Leu Lys 
225 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) XiENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Spinach RS precursor 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATGGCACTTT CAACTTCACT CTCTTTAGTA TCTCCCAAAC TCTCTCAACA AAATCTCACA 60 

TTTTGCACCT TCAACAACCA ACCCTCCTCT TTAAATGGGC ATATCAAATT CAATCCAAAC 120 

30 

CTCAGAAACT CAGTCTCTAA ACTCTTTATC ACCACCCAAA ACACCCGATT CCTAAAATTT 180 

CGGTACGTAA GGAATCAAAT AAACTCCATG TTCACTGGCA TTGTTGAAGA GATTGGCCGA 24 0 

25 GTrAAGCAAA TGGGTTATGG CGAAGACGGT GGATTTCAGC TTAAAGTTGT AGGAGACATT 300 

GTCCTAAAAG ATGTCAATCT TGGTGACAGT ATCGCAGTTA ATGGTACATG TCTAACTGTG 360 

ACGGAATTTG ACACTAAAGC GTCCGAATTT ACTCTTGGGA TAGCGCCTGA GACGCTTAGG 420 

40 

AAGACGGCAT TGATGGATCT CGAACCAGGG TCAGTTGTTA ATTTAGAAAG AGCCCTTTTG 480 
CCTTCTACAC GGATGGGTGG TCACTTTGTC CAGGGACATG TTGATGGGAC AGGAGAAATT 540 

^ GTATCACTAG TTGAAGAAGG TGATTCTTTG TGGGTCAAGA TAAAAACAAG CCCAGAAATA 600 

CTGAGATACA TTGTACCAAA AGGGTTTATT GCAATTGATG GCACAAGTTT AACAGTGGTG 660 
GATGTGTTTQ ACCAAGAATT ATGCTTTAAT ATTATGTTAG TTGCTTACAC TCAACAAAAT 720 

so GTGGTCATTC CACTCAAAAA AGTTGGCCAA AAGGTTAATT TAGAGGTTGA TATTCTAGGA 780 

AAATATGTGG AAAGGCTCCT AAGTAGTAGT GGGGTTTTGG ATCCTACCAA ATTCACATAG 840 

55 
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(2) INFOHMATION FOR SEQ ID KO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLCXSY: not relevant 

(ii) MOLECULE TVPE: protein 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Spinach RS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Ala Leu Ser Thr Ser Leu Ser Leu Val Ser Pro Lys Leu Ser Gin 
1 5 10 15 



Gin Asn Leu Thr Phe Cys Thr Phe Asn Asn Gin Pro Ser Ser Leu Asn 
20 25 30 

Gly His lie Lys Phe Asn Pro Asn Leu Arg Asn Ser Val Ser Lys Leu 
35 40 45 

Phe He Thr Thr Gin Asn Hir Arg Phe Leu Lys Phe Arg Tyr Val Arg 
50 55 60 



Asn Gin lie Asn Ser Met Phe Thr Gly He Val Glu Glu He Gly Arg 
65 70 75 80 

Val Lys Gin Met Gly Tyr Gly Glu Asp Gly Gly Phe Gin Leu Lys Val 
85 90 95 

Val Gly Asp He Val Leu Lys Asp Val Asn Leu Gly Asp Ser He Ala 
100 105 110 

Val Asn Gly Thr Cys Leu Thr Val Thr Glu Phe Asp Thr Lys Ala Ser 
115 120 125 

Glu Phe Thr Leu Gly He Ala Pro Glu Thr Leu Arg Lys Thr Ala Xjeu 
130 135 140 

Met Asp Leu Glu Pro Gly Ser Val Val Asn Leu Glu Arg Ala Leu Leu 
145 150 155 160 

Pro Ser Thr Arg Met Gly Gly His Phe Val Gin Gly His Val Asp Gly 
165 170 175 



Thr Gly Glu He Val Ser Leu Val Glu Glu Gly Asp Ser Leu Trp Val 
180 185 190 
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Lys lie Lys Thr Ser Pro Glu He I«eu Arg Tyr lie VaX Pro Lys Gly 
195 200 205 

Phe He Ala He Asp Gly Thr Ser Leu Thr Val Val Asp Val Phe Asp 
210 215 220 

Gin Glu Leu Cys Phe Asn He Met Leu Val Ala Tyr Tbr Gin Gin Asn 
225 230 235 240 

Val Val lie Pro Leu Lys Lys Val Gly Gin Lys Val Asn Leu Glu Val 
245 250 255 

Asp He Leu Gly Lys Tyr Val Glu Arg Leu Leu Ser Ser Ser Gly Val 
260 265 270 

Leu Asp Pro Thr Lys Phe Thr 
275 

20 (2) INFORMATION FOR SEQ ID N0:3S; 

(i) SSQUENCS CHARACTERISTICS: 

(A> LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRA19DEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
30 Uii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOtJRCE: 

^ (C) INDIVIDUAL ISOIATE: arabidopsis RS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATGATGGCGG CTCGTACTCA TTGTATCAAC CTTATCCCCA AAGTATGTCT TCCACAATCC 60 
TTCAGAACTG GAGAATCAGT GACTAATCTC AGATTTGATT GCGTCTCTAA GTCATCGAAG 120 
CTTTCTCTCA AGACATCATG TGGAAGATCA AGAACGCATC ACCGGAGGCA AAATCTCAGC 180 
ATCCGGTCCG TGTTTACTGG AATCGTGGAG GAAATGGGTG AAGTCAAGGA CTTGGGAATG 240 
GCCGATCACG GAGGATTCGA CCTCAAAATC GGAGCGAGAG TGGTGTTAGA GGACGTGAAG 300 
CTCGGTGACA GTATCGCCGT GAACGGTACT TGTTTAACGG TGACGGAGTT TAACGCAGAG 360 
SO GAGTTCACAG TAGGGTTAGC ACCGGAGACG CTGAGAAAAA CATCGTTGGA GGAGTTAAAG 420 

AAAGGATCTC CGGTGAATCT GGAGCGTGCG TTGCAGCCAG TGAGCAGGAT GGGTGGACAC 480 
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GTGGTTCAGG GACACGTGGA TGGGACGGGA QTGATTGAAT CAATOOAGGT AGAGGGTGAT 540 
TCTTT6TGG6 TGAAGGTGAA A6CTGACAA6 GGTTTGTTGA AATACATTGT GCCTAJUUSGA 600 

5 

TTTGTGGCTG TTGATGGGAC TAGCTTGACG GTTGTTGATG TGTTTGATQA A5AGAGCTGT €60 
TTCAATTTCA TGATGATTOC TTATACGCAA CAQAATGTAG TGATTCCGAC TAAQAAGATT 720 
GGGCAGAAAG TGAATCTTGA GGTTGATATC ATGGGGAAGT ATGTTGAGAG GCTTCTCACC 780 
AGTGGTGGCT TCTCCAAAGG AAAAGAAAAT ATTTGA 816 

(2) INFORMATION FOR SEQ ID NO: 36: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY:, not relevant 

(ii) MOLECOLE TYPE: protein 

(vi) ORIGINAL SOtmCE: 

(C) INDIVIDUAL ISOLATE: arabidopsis RS precursor 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Met Ala Ala Arg Thr His Cys lie Asn Leu lie Pro Lys Val Cys 
15 10 15 

Xjeu Pro Gin Ser Phe Arg Thr Gly Glu Ser Val Thr Asn X^u Arg Phe 
20 25 30 

Asp Cys Val Ser Lys Ser Ser Lys Leu Ser Leu Lys Thr Ser Cys Gly 
35 40 45 

Arg Ser Arg Thr His His Arg Arg Gin Asn Leu Ser lie Arg Ser Val 
50 55 60 

Phe Thr Gly He Val Glu Glu Met Gly Glu Val Lys Asp Leu Gly Met 
65 70 75 80 

Ala Asp His Gly Gly Phe Asp Leu Lys He Gly Ala Arg Val Val Leu 
85 90 95 

45 Glu Asp Val Lys Leu Gly Asp Ser He Ala Val Asn Gly Thr Cys Leu 

100 105 110 

Thr Val Thr Glu Phe Asn Ala Glu Glu Phe Thr Val Gly Leu Ala Pro 
115 120 125 
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Glu Thr Leu Arg Lys Thr Ser Leu Glu Glu Leu Lys Lys Gly Ser Pro 
130 135 140 
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Val Asn I*eu Glu Arg Ala Leu Gin Pro Val Ser Arg Met Gly Gly His 
145 150 155 X€0 

Val Val Gin Gly His Val Asp Gly Thr Gly Val lie Glu Ser Met Glu 
165 170 175 

Val Glu Gly Asp Ser Leu Trp Val Lys Val Lys Ala Asp Lys Gly Leu 
180 165 190 

Leu Lys Tyr lie Val Pro Lys Gly Phe Val Ala Val Asp Gly thr Ser 
Ids 200 205 

Leu Thr Val Val Asp Val Piie Asp Glu Glu Ser Cys Phe Asn Phe Met 
210 215 220 

Met He Ala Tyr Thr Gin Gin Asn Val Val He Pro Thr Lys Lys He 
225 230 235 240 

Gly Gin Lys Val Asn Leu Glu Val Asp He Met Gly Lys Tyr Val Glu 
245 250 255 

Arg Leu Leu Thr Ser Gly Gly Phe Ser Lys Gly Lys Glu Asn He 
260 265 270 

(2) INFORMATION FOR SBQ ID NO: 37: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : dO\lble 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: M. grisea LS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ATGCACACCA AAGGCCCGAC CCCGCAGCAG CACGACGGCT CCGCCCTGCG CATCGGCATC 60 
GTGCACGCGC GCTGGAACGA GACCATCATC GAGCCGCTTC TGGCCGGCAC AAAAGCCAAG 120 
CTGCTGGCCT GCGGCGTCAA GGAGTCCAAC ATAGTCGTGC AGAGCGTTCC GGGGTCGTGG 180 
GAGCTGCCAA TAGCCGTGCA GAGGCTCTAC TCCGCATCCC AGCTCCAAAC CCCAAGCTCC 240 
GGCCCATCTC TGTCGGCCGG CGACCTGCTC GGCTCCTCGA CCACAGATCT TACCGCGCTC 300 
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CCGACCACCA CTGCCTCATC CACCGGCCCC TTTGACGCCC TCATCGCCAT CGGCGTGCTA 360 
ATCAAGGGCG AGACGATGCA CTTTGAGTAC ATTGCCGATT CG6TCTCGCA CGGCCTGATG 420 
CGCGTACAGC TCGACACGGG CGTCCCAGTT ATCTTCGGCG TCCTAACAGT CCTGACCGAC 480 
GACCAGGCCA AGGCTCGTGC CGGCGTCATC GAGGGCAGCC ACAACCACGG CGAGGACTGG 540 
GGCCTGGCCG CCGTTGAGAT GGGTGTGCGC AGGAGGGATT GGGCTGCCGG GAAGACCGAG 600 
TGA 603 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: M. grisea LS 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Kis Thr Lys Gly Pro Thr Pro Gin Gin His Asp Gly Ser Ala Leu 
1 5 10 15 

Arg lie Gly lie Val His Ala Arg Trp Asn Glu Thr lie lie Glu Pro 
20 25 30 

Leu Leu Ala Gly Thr Lys Ala Lys Leu Leu Ala Cys Gly Val Lys Glu 
35 40 45 

Ser Asn lie Val Val Gin Ser Val Pro Gly Ser Trp Glu Leu Pro lie 
50 55 60 

Ala Val Gin Arg Leu Tyr Ser Ala Ser Gin Leu Gin Thr Pro Ser Ser 
65 70 75 80 

^ Gly Pro Ser lieu Ser Ala Gly Asp Leu Leu Gly Ser Ser Thr Thr Asp 

85 90 95 . 

Leu Thr Ala Leu Pro Thr Thr Thr Ala Ser Ser Thr Gly Pro Phe Asp 
100 105 110 



Ala Leu lie Ala lie Gly Val Leu lie Lys Gly Glu Thr Met His Phe 
115 120 125 
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Glu Tyr He Ala Asp Ser Val Ser His Gly Leu Met Arg Val Gin I*eu 
130 135 140 

Asp Thr Gly Val Pro Val lie Phe Gly Val Leu Thr Val Leu Thr Asp 
145 150 155 160 

Asp Gin Ala Lys Ala Arg Ala Gly Val He Glu Gly Ser His Asn His 
165 170 175 

Gly Glu Asp Trp Gly Leu Ala Ala Val Glu Met Gly Val Arg Arg Arg 
180 185 190 

Asp Trp Ala Ala Gly Lys Thr Glu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 39: 

^ (i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevauit 
(P) TOPOLOGY: not relevant 
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(ii) MOLECOLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 39: 

Ala Ser Leu Phe Glu His His X^eu Lys 
1 5 



Claims 

1. An isolated nucleic acid fragment comprising a sequence encoding a plant RS enzyme selected from the group 
consisting of : 



(a) an isolated nucleic acid fragment encoding all or a substantial portion of the amino acid sequence selected 
from the group consisting of SEQ ID NO:8 and SEQ ID NO: 10; 

(b) an isolated nucleic add fragment that is substantially ^milar to or is homologous to an isolated nucleic acid 
fragment encoding all or a substantial portion of the amino acid sequence selected from the group cdnsisting 

45 ofSEQIDNO:8andSEQIDNO:10; t 

(c) an isolated nudeic acid fragment encoding a polypeptide having at least 70% identity with the amino add 
sequence selected from the group consisting of SEQ ID N0:8 and SEQ ID NO:10; and 

(d) an isolated nucleic add fragment that is complementary to (a), (b) or (c). 

so 2. The isolated nudeic add fragment of daim 1 selected from the group consisting of SEQ ID N0:7 and SEQ ID NO:9. 

3. The isolated nucleic acid fragment of claim 1 or claim 2 encoding a plant RS enzyme, wherein the plant is spinach 
or arabidopsis. 

55 4. An isolated nudeic add fragment comprising a sequence encoding a fungal RS enzyme selected from the group 

consisting of: 

(a) an isolated nucleic acid fragment encoding all or a substantial portion of the amino add sequence set forth 
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inSEQIDN0:12; 

(b) an isolated nucleic add fragment that is substantially similar to or is homologous to an teolated nucleic acid 
fragment encoding ail or is homologous to at least a substantial portion of the amino acid sequence set forth 
in SEQ ID NO: 12; 

5 (c) an isolated nucleic add fragment that is complementary to (a) or (b). 

5. The isolated nucleic add fragment of claim 4 as set forth in SEQ ID NO: 11. 

6. The isolated nucleic add fragment of daim 4 or claim 5 encoding a fungal RS enzyme obtained from Magnaporthe 
10 grisea. 

7. An isolated nudeic acid fragment comprising a sequence encoding a plant LS enzyme, selected from the group 
consisting of : 

IS (a) an isdated nudeic add fragment encoding all or a substantial portion of the amino acid sequence selected 

from the group consisting of SEQ ID N0:2, SEQ ID N0:4 and SEQ ID N0:6; 

(b) an isolated nucleic add fragment that is substantially similar to or is homologous to an isolated nucleic acid 
fragment encoding all or a substantial portion of the amino add sequence selected from the group consisting 
of SEQ ID NO:2, SEQ ID N0:4 and SEQ ID NO:6; 

20 (c) an isolated nudeic acid fragment encoding a polypeptide having at least 72% identity with the amino acid 

sequence selected from the group consisting of SEQ ID N0:2. SEQ ID N0:4 and SEQ ID N0:6; and 
(d) an isolated nudeic add fragment that is complementary to (a), (b) or (c). 

8. The isolated nudeic acid fragment of claim 7 selected from the group consisting of SEQ ID NO:1 . SEQ ID NO:3 
25 and SEQ ID NO:5. 

9. The isolated nucleic add fragment of claim 7 or claim 8 encoding a plant LS enzyme, wherein the plant is spinach, 
tobacco or arabidopsis. 

30 10. An isolated nucleic acid fragment comprising a sequence encoding a fungal LS enzyme selected from the group 
consisting of 

(a) an isolated nucleic acid fragment encoding all or a substantial portion of the amino add sequence set forth 
in SEQ ID NO:38: 

35 (b) an isolated nucleic add fragment that is substantially similar to or is homologous to an isolated nudeic acid 

fragment encoding all or is homdogous to at least a substantial pordon of the amino acid sequence set forth 
in SEQ ID NO:38; and 

(c) an isolated nucleic acid fragment that Is complementary to (a) or (b). 

40 11. The isolated nucleic acid fragment of claim 10 as set forth in SEQ ID NO:37. 

1 2. The isolated nudeic add fragment of claim 1 0 or claim 5 encoding a fungal LS enzyme obtained from Magnaporthe 
grisea. 

45 1 3. An isolated nucleic add fragment selected from the group consisting of: 

(a) an isolated nudeic acid fragment comprising a sequence encoding all or a substant'al portion of the amino 
acid sequence selected from the group consisting of SEQ ID NO:27, SEQ ID NO:29, SEQ ID N0:31, SEQ ID 
NO:33 and SEQ ID NO: 35; 

so (b) an isolated nudeic acid fragment comprising a sequence that is substantially similar to a isolated nucleic 

acid fragment encoding all or a substantial portion of the amino add sequence selected from the group con- 
sisting of SEQ ID NO:27. SEQ ID NO:29, SEQ ID N0:31. SEQ ID NO:33 and SEQ ID NO:35; and 
(c) an isolated nucleic acid fragment that is complemerrtary to (a) or (b). 

55 14. The isolated nucleic acid fragment of claim 13 comprising a sequence selected from the group consisting of SEQ 
ID NO:28. SEQ ID NO:30. SEQ ID NO:32. SEQ ID NO:34 and SEQ ID NO:36. 

15. A polypeptide encoded by the isolated nucleic acid fragment of any one of claims 1 to 14. 
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1 6. The polypeptide of claim 1 5 having at least 72% identity with the amino acid sequence selected from the group con- 
sisting of SEQ ID N0:2, SEQ ID N0:4 and SEQ ID NO: 6. 

17. The polypeptide of daim 15 selected from the group consisting of SEQ ID MO:2, SEQ ID N0:4 and SEQ ID NO:6. 

18. The polypeptide of claim 15 having the amino acid sequence as set forth in SEQ ID NO:38. 

1 9. The polypeptide of daim 1 5 having at least 70% identity with the amino acid sequence selected from the group con- 
sisting of SEQ ID NO:8 and SEQ ID NO:10. 

20. The polypeptide of ciaini 15 selected from the group consisting of SEQ ID NO:8 and SEQ ID NO:10. 

21 . The polypeptide of claim 1 5 having the amino acid sequence as set forth in SEQ ID NO: 12. 

75 22. The polypeptide of claim 15 selected from the group consisting of SEQ ID NO:27. SEQ ID NO:29, SEQ iD NO:31, 
SEQ ID NO:33 and SEQ ID NO:3& 

23. A chimeric gene comprising the isolated nucleic acid fragment of any one of claims 1 to 14 operably linked to suit- 
able regulatory sequences. 

20 

24. A transformed host cell comprislrig a host cell and the chimeric gene of daim 23. 

25. The transformed host cell of claim 24 wherein the host cell is a plant cell. 
25 26. The transformed host ceil of claim 24 wherein the host cell is E. colL 

27. A method of altering the level of expression of a plant or fungal LS enzyme or a plant or fungal RS enzyme in a host 
cell comprising: 

30 (a) transforming a host cell with the chimeric gene of claim 23; and 

(b) growing the transfamed host cell of step (a) under conditions that are suitable for expression of the chi- 
meric gene, 

resulting in production of altered levels of a plant or fungal LS or RS enzyme in the transformed host cell rela- 
tive to expressfon levels of a untransformed host cell. 

35 

28. A method of obtaining a nucleic acid fragment encoding all or a substantial portion of the amino add sequence 
encoding a plant LS enzyme comprising: 

(a) probing a cDNA or genomic library with the nucleic add fragment of any one of claims 7 to 9; 
40 (b) identifying a DNA clone that hybridises with the nucleic add fragment of any one of claims 7 to 9; and 

(c) sequencing the cDNA or genomic fragment that comprises the done identified In step (b). 

wherein the sequenced cDNA or genomic fragment encodes all or sutsstantially all of the amino add sequence 
encoding a plant LS enzyme. 

45 29. A method of obtaining a nucleic acid fragment encoding all or a substantial portion of the amino add sequence 
encoding a plant LS enzyme comprising: 

(a) syrrthestzing an oligonudeottde primer corresponding to all or a portion of the sequence selected from the 
group consisting of SEQ ID NO:1 . SEQ ID NO:3 and SEQ ID NO:5: and 
so (b) amplifying a cDNA insert present in a cloning vector using the oligonucleotide primer of step (a) and a 

primer representing sequences of the cloning vector. 

wherein the amplified cDNA insert encodes all or a portion of an amino acid sequence encoding a plant LS 
enzyme. 

£5 30. A method of obtaining a nucleic acid fragment encoding all or substantially all of tiie amino acid sequence encoding 
a fungal LS enzyme comprising 

(a) probing a cDNA or genomic library with tiie nucleic add fragment of any one of daims 10 to 12; 
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(b) identifying a DNA clone that hybridizes with the nucleic add fragment of any one of daims 10 to 12; and 

(c) sequendng the cDNA or genomic fragment that comprises the done identified in step (b), 

wherein the sequenced cDNA or genonvc fragment encodes ail or substantially all of the amino add sequence 
encoding a fungal LS enzyme. 

5 

31. A method of obtaining a nudeic add fragment encoding all or a substantial portion of the amino add sequence 
encoding a fungal LS enzyme comprisng 

(a) synthesizing an ofigonudeotide primer conesponding to all or a portion of the sequence set forth in SEQ 
10 ID IMO:37: and 

(b) amplifying a cDNA insert present in a doning vector using the oligonudeotide primer of step (a) and a 
primer r^resenting sequences of the cloning vector 

wherein the amplified cDNA Insert encodes all or a portion of an amino acid sequence encoding a fungal LS 
enzyme. 

75 

32. A method of obtaining a nucleic acid fragment encoding all or sut)s1antially all of the amino add sequence encoding 
a plant RS enzyme comprising: 

(a) prdsing a cDNA or genomic library with the nucleic add fragment of arty one of claims 1 to 3; 
20 (b) identifying a DNA clone that hybridizes with the nucleic add fragment of any one of daims 1 to 3; and 

(c) sequencing the cDNA or genomic fragment that comprises the clone identified in step (b)- 

wherein the sequenced cDNA or genomic fragment encodes all or substantially all of the amino acid sequence 
encoding a plant RS enzyme. 

25 33. A method of obtaining a nudeic acid fragment encoding all or a substantial portion of the amino add sequence 
encoding a plant RS enzyme comprising: 

(a) synthesizing an oligonudeotide primer corresponding to all or a portion of tiie sequence selected from the 
group consisting of SEQ ID NO:7 and SEQ ID N0:9: and 
30 (b) amplifying a cDNA insert present in a cloning vector using the oligonucleotide primer of step (a) and a 

primer representing sequences of the doning vector. 

wherein the amplified cDNA insert encodes all or a portion of an amino acid sequence encoding a plant RS 
enzyme. 

35 34. A method of obtaining a nucleic acid fragment encoding all or substantially all of the amino acid sequence encoding 
a fungal RS comprising 

(a) probing a cDNA or genomic library with tiie Isolated nucleic acid fragment of any one of claims 4 to 6: 

(b) identifying a DNA done that hybridizes witii the isolated nucleic add fragment of any one of claims 4 to 6; 
40 and 

(c) sequencing the cDNA or genomic fragment that conprises the clone identified in step (b). 

wherein the sequenced cDNA or genonvc fragment encodes ail or substsuitially all of the amino add sequence 
encoding a fungal RS. 

i 

45 35. A method of obtaining a nudeic acid fragment encoding all or a substantial portion of the amino add sequence 
encoding a fungal RS comprising 

(a) synthesizing an oligonudeotide primer corresponding to all or a portion of the sequence set forth in SEQ 
IDNO:11;and 

so (b) amplifying a cDNA Insert present in a cloning vector using tiie oligonucleotide primer of step (a) and a 

primer representing sequences of the cloning vector. 

wherein the amplified cDNA insert encodes all or a portion of an amino add sequence encoding a fungal RS 
enzyme. 

55 36. The product of the metiiod of any one of claims 28 to 35. 

37. An in vivo method for identifying as a herisicide candidate a chemical compound tiiat inhibits the activity of a plant 
LS enzyme encoded by tiie isolated nucleic acid fragment of any one of clairhs 7 to 9, the method comprising tiie 
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steps of: 

(a) disrupting the endogenous LS gene of a suitable microt)ial host cell, rendering growth of the microbial host 
ceil dependent on added riboflavin; 
5 (b) transforming the altered microbial host cell of step (a) with a chimeric gene comprising the isolated nucleic 

acid fragment of any one of claims 7 to 9 encoding a plant LS enzyme and operably linked to at least one suit- 
able regulatory sequence that allows its expression by the microbial host cell; 

(c) growing the transformed microbial host cell of step (b) under conditions that allow expression of the chi- 
meric plant LS gene; 

-io (d) contacting the transformed microbial host cell of step (c) while it is growing exponentially with a chemical 

compound of interest and in both the presence and absence of added riboflavin; 

(e) identifying as a heri3idde candidate the chenfiical compound of interest that inhibits growth of the trans- 
Ibrmed microbial host cell only when grown in the absence of added riboflavin. 

75 38. An in vitro method for identifying as an herbicide candidate a chemical compound that inhibits the activity of a plant 
LS enzyme encoded by the isolated nucleic acid fragment of any one of claims 7 to 9, the method comprising the 
steps of: 

(a) transforming a host cell with a chimeric gene comprising the isolated nucleic acid fragment of any one of 
20 claims 7 to 9 encoding a plant LS enzyme, the chimeric gene operably linked to at least one suitable regulatory 

sequence; 

(b) growing the transformed host cell of step (a) under conditions suitable for expression of the chimeric gene 
resulting in production of the plant LS enzyme; 

(c) purifying the plant LS enzyme expressed by the transformed host cell; 

25 (d) contacting tiie plant LS enzyme with a chemical compound of interest; and 

(e) identifying as a herbickie candidate the chemical compound of interest that reduces the activity of the plant 
LS enzyme relative to the activity of tiie plant LS enzyme in the absence of the chemical compound of interest. 

39. The nietiiod of daim 37 or claim 38 wherein the nucleic acid fragment is selected from the group consisting of SEQ 
30 ID NO: 1 , SEQ ID N0:3 and SEQ ID N0:5 and wherein ttie LS enzyme is selected from the group consisting of 

SEQ ID N0:2. SEQ ID N0:4 and SEQ ID NO:6. 

40. An in vivo metiiod for identifying as a fungicide candidate a chemical compound tiiat inhibits the activity of a fungal 
LS enzyme encoded by the isolated nucleic acid fragment of any one of claims 10 to 12. the method comprising 

3S tiie steps of 

(a) disrupting the endogenous LS gene of a suitable microbial host, rendering growtii of the microbial host 
dependent on added riboflavin; 

(b) transforming the altered microbial host cell of step (a) with a chimeric gene comprising a nucleic acid frag- 
40 ment of any one of claims 10 to 12 encoding a fungal LS enzyme and operably linked to at least one suitable 

regulatory sequence that allows its expression by the transformed microbial host cell; 

(c) growing the transformed microbiai host cell of step (b) under conditions suitatsle for expression of the chi- 
meric fungal LS gene; 

(d) contacting the transformed microbial host cell of step (b) while it is growing exponentially with a chemical 
45 compound of interest and in both the presence and absence of added riboflavin; and 

(e) identifying as a fungicide candidate the chemical compound of interest that inhibits growtii of tiie trans- 
formed microbial host cell only when grown in tiie absence of added riboflavin. 

41 . An in vitro method for identifying as a fungicide candidate a chemical compound that inhibits the activity of a fungal 
so LS enzyme encoded by tiie isolated nucleic acid fragment of any one of claims 10 to 12. the method comprising 

the steps of 

(a) transforming a host cell with a chimeric gene comprising a nucleic acid fragment of any one of dain« 10 to 
12 encoding a fungal LS enzyme, the gene operably linked to at least one suitable regulatory sequence; 
ss (b) growing tiie transformed host cell of step (a) under conditions suitable for expression of the chimeric gene 

resulting in the production of the fungal LS enzyme; 

(c) purifying tiie fungal LS enzyme expressed by tiie transformed host cell; 

(d) contacting tiie fungal LS enzyme witii a chemical compound of interest; and 
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(e) identifying as a fungicide carKiidate the chennical compound of interest that reduces the activity of the fun- 
gal LS enzyme relative to the activrty of the fungal LS enzyme in the absence of the chemical conpound of 
interest. 

5 42. The method of claim 40 or claim 41 wherein the isolated nucleic acid fragment corresponds to the sequence in SEQ 
ID NO:37 and wherein the fungal LS eizyme fragment con-e^nds to the sequence in SEQ ID NO:38. 

43. An in vh/o method for identifying as a herbicide candidate a chemical compound tiiat inhibits tiie activity of a plant 
RS enzyme encoded b/ tiie isolated nucleic acid fragment of any one of claims 1 to 3. tiie metiiod comprising the 

10 steps of: . . 

(a) disrupting the endogenous RS gene of a suitable microbial host rendering growth of tine microbial host 
dependent on added riboflavin; 

(b) transforming the altered microbial host cell of step (a) with a chimeric gene comprising the nucleic acid frag- 
15 ment of any one of claims 1 to 3 encoding a plant RS enzyme and operably linked to at least one suitable reg- 
ulatory sequence that allows its expression by the transformed microbial host cell; 

(c) growing tiie t-ansfbrmed microbial host cell of step (b) under conditions ttiat allow expression of tiie chi- 
meric plant RS gene; 

(d) contacting the transformed microbial host cell of step (c) while it is growing exponentially with a chemical 
20 compound of interest and in both tiie presence and absence of added riboflavin; and 

(e) identifying as an herbicide candidate the chemical compound of interest that inhibits growth of the trans- 
formed microbial host cell only when grown in tiie absence of added ritsof lavin. 

44. An in vitro metiiod for identifying as a herisicide candidate a chemical compound ttiat inhibits tiie activity of a plant 
25 RS enzyme encoded by the nucleic add fragment of any one of claims 1 to 3, the method compri^ng the steps of: 

(a) transforming a suitable microbial host cell witii a chimeric gene comprising tiie nucleic acid fragment of any 
one of claims 1 to 3 encoding a plant RS enzyme, tiie chimeric gene operably linked to at least one suitable 
regulatory sequence; 

30 (b) growing tiie f ansformed micrdbial host cell of step (a) under conditions suitable for expression of tiie chi- 

meric gene resulting in the production of tiie plant RS enzyme: 

(c) purifying the plant RS enzyme expressed by tiie transformed microbial host cell; 

(d) contacting the plant RS enzyme with a chemical compound of interest; and 

(e) identifying as a herisicide candkiate the chemical compound of interest tiiat reduces the activity of tiie plant 
35 RS enzyme relative to the activity of the plant RS enzyme in the absence of tiie chemical compound of interest 

45. The method of claim 43 or claim 44 wherein the nucleic acid fragment is selected from tiie group consisting of SEQ 
ID NO:7 and SEQ ID NO:9 and wherein the RS enzyme is selected from tiie group consisting of SEQ ID NO:8 and 
SEQ ID NO: 10. 

40 

46. An in vivo metiiod for identifying as a fungicide candidate a chemical compound tiiat inhibits the activity of a fungal 
RS enzyme encoded by the isolated nucleic acid fragment of any one of claims 4 to 6, tiie metiiod comprising tiie 
steps of: 

45 (a) disrupting the endogenous RS gene of a suitable microbial host rendering growth of tiie microbial host 

dependent on added riboflavin: 

(b) transforming the altered microbial host cell of step (a) with a chimeric gene comprising a nucleic acid frag- 
ment of any one of claims 4 to 6 encoding a fungal RS enzyme and operably linked to at least one suitable reg- 
ulatory sequence tiiat allows its expression by tiie transformed microbial host cell; 

50 (c) growing the ti-ansfbrmed microbial host ceil of step (b) under conditions tiiat allow expression of tiie chi- 

meric fungal RS gene; 

(d) contacting the transformed microbial host cell of step (c) while it is growing exponentially with a chemical 
compound of interest and in both the presence ard absence of added riboflavin; and 

(e) Identifying as a fungicide candidate the chemical compound of interest that inhibits growtii of tiie trans- 
55 formed host cell only when grown in the absence of added riboflavin. 

47. An in vitro metiiod for identifying a chemical compound tiiat inhibits the activity of a fungal RS enzyme encoded by 
the isolated nucleic acid fragment of any one of claims 4 to 6, the metiiod comprising tiie steps of 
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(a) transforming a host cell with a chimeric gene comprising an isolated nucleic acid fragment of any one of 
claims 4 to 6 encoding a fungal RS enzyme, the gene operably linked to at least one suitable regulatory 
sequence; 

(b) growing the transformed host cell of step (a) under conditions suitable for expression of the chimeric gene 
5 resulting in the production of the fungal RS enzyme; 

(c) purifying the fungal RS enzyme expressed by tiie transformed host cell; 

(d) contacting tfie fungal RS enzyme with a chemical contpound; and 

(e) identifying as a fungicide candidate tiie chemical compound of interest that reduces the activity of the fun- 
gal RS enzyme relative to tiie activity of the fungal RS enzyme in tiie absence of tiie chemical compound of 

70 interest 

48. The method of claim 46 or claim 47 wherein tiie Isolated nucleic acid fragment corresponds to the sequence in SEQ 
ID NO: 1 1 and wherein tiie fungal RS enzyme fragment corresponds to tiie sequence in SEQ ID N0:12. 

75 49. A plant containing the chimeric gene of daim 23. 

50. A seed derived from the plant of daim 49. 
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E. coli 
B. subt. 
p. leio. 
p. phos, 
S. cere, 
spinach 



E. coli 
B. subt, 
P. leio, 
p. phos. 
S- cere, 
spinach 



E. coli 
B . subt . 
p. leio, 
phos. 
S, cere, 
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